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THE EQUATIONS OF MOTION OF THE MOON. 


By Ernest W. Brown.* 


1. In Volume 1 of this Journal, G. W. Hill published a paper entitled 


“Researches in the Lunar Theory ” 


which has had far-reaching effects on the 
later development of the subject. From a study of it H. Poincaré conceived 
and developed the idea of the periodic orbit which in his hands became the 
basis for an extended study of the mathematics of the problem of three bodies. 
The calculation and classification of orbits of the type studied by Hill have 
formed the subject of many papers by various authors since his time. In my 
own case, it formed the starting point for a complete recalculation of the aciion 
of the sun on the motion of the moon. It is mainly the methods used in this 
last work that I am going to reconsider here. 

It is not wise, I believe, to imagine that the work of a great master in his 
subject cannot be improved, especially when extended calculations are involved. 
As time goes on, new techniques are developed and in our own time calculating 
machines have come within the reach of everyone, so that a method previously 
rejected on account of the extended computation required may become easy 
and simple with but slight changes. My own experience in several cases leads 
me to think that it is worth while to return to the foundations even when they 
go so far back in time as those of Celestial Mechanics, and to examine whether 


valuable changes cannot be made. 


2. The present paper is the result of such an examination. In my calcu- 
lation, I adopted Hill’s equations as they stood merely adding the terms in 
the disturbing function which complete accuracy demanded and adapting the 
solution of the equations to the additional terms. Two or more years ago it 
occurred to me to try the effect of referring the motion to axes which followed 
the mean position of the moon. JI found later that Euler had had the seme 
idea in his second theory. Hill had used axes which followed the mean 
position of the sun. There was a reason for this change as the codrdinates 
would then take the form a+ 2, y where x, y are small and therefore ex- 
pansions in powers of z, y were possible. What I did not expect, however, was 
that the equations for x, y could be put into a form such that the first gave 
tand then the second gave y with the same degree of approximation that had 

* Professor Brown, who successfully undertook the great task of developing and 
applying Hill’s theory of the Moon’s motion, died on July 22, 1938, some days after 


tevising the proof of the present paper. Eprrors. 
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786 ERNEST W. BROWN. 
been obtained before by the troublesome solution of two simultaneous equa- 
tions. Further, the homogenous equation for x had fewer large terms present 
than the earlier equation which it superseded, that for y being substantially the 
same as before. 

A further advantage in calculation is due to the fact that the lowest power 
of n’ or m present is the second; in Hill’s equations the first power occurs. 
Finally they have permitted me to find out the loss of accuracy due to small 
divisors which has occurred through the whole theory. This last feature is 
developed below and leads to the satisfactory result that the loss is considerably 
less than I had supposed. This fact has already appeared in the verification 
now under way. 

When e’ = 0, the time disappears explicitly from Hill’s equations. This 
property has considerable advantage in theoretical researches and in the search 
for periodic orbits. But it is of no advantage in the calculation of satellite 
orbits. On the contrary it is a disadvantage in the latter case because the 
elimination of ¢ deprives the equations of the advantages mentioned above for 


the new form. 


3. The next step was an attempt to abbreviate the calculation of the 
various terms in the equations. In the so-called homogeneous form, if we 
neglect the small ratio of the distances, every term is a second degree term 
in x, y, z or their derivatives. As the codrdinates consist of series of harmonic 
terms, by far the greatest part of the work consists of the multiplication of 
pairs of harmonic series. If then a technique could be developed for the multi- 
plication of harmonic series by machinery practically the whole of the work 
of calculating the action of the sun on the moon could be done in perhaps a 
tenth of the time it originally required. I was fortunate in interesting my 
friend and former pupil, Professor W. J. Eckert of Columbia University, in 
this problem. He had already adapted several commercial machines of the 
Hollerith type to the solution of problems of celestial mechanics, and had 
developed a computing laboratory for their effective use. With the aid of the 
facilities of this laboratory we have been able in a year or two to test and 
extend calculations which took me nearly twenty years to carry out with the 
old-fashioned methods. These facts are mentioned bevause we are using the 
new equations in this verification. 

4, The equations have also been used in another connection, namely, the 


exact calculation? of a portion of the motion of the perigee which had been 
neglected as insensible in the earlier work, but which I had found by another 


1 Monthly Notices of the Royal Astronomical Society, vol. 97 (1936), pp. 116-127 


me 

pla 

we 

of | 

a 

firs 

for 

mea 

In 
| 1 
whe 

Na? 

in tl 
ot x 

| Hen 

first 

r+ 

know 


THE EQUATIONS OF MOTION OF THE MOON. 


method ° must be large enough to account for most of the previously unex- 
plained difference between the observed and theoretical values. 

These equations appear to be the best to adopt for satellite motion when 
we can use the variation orbit as a first approximation and expand in powers 
of the eccentricities, inclination and ratio of the distances. For a satellite with 
a large eccentricity like Jupiter VIII which seems to require the ellipse as a 
frst approximation, I hope to give elsewhere a method which appears to have 
considerable advantages over that which Professor Brouwer and I have used 
for this body and for which I was responsible. 


THE Non-HOMOGENEOUS EQUATIONS. 


5. The equations of motion referred to an z-axis in the direction of the 
mean position of the moon, longitude nt -+ «, the y-axis perpendicular to it 
in a fixed plane and the z-axis perpendicular to this plane are: 


2ny + + 27) 
5.1 ij + 2né 
) + 2na + y n ) dy? 
; 


where a(1 + 2), ay, az are the codrdinates of the moon, ar the distance and 
Qa? the disturbing function due to the sun, supposed to move in an ellipse 
in the vy-plane. 

When the moon is undisturbed and moves in a circular orbit in the plane 


of zy we have 


= == 0), f= 1, a= ad, = n?, 


Hence the factor in brackets in the first two equations is small. 
If we add the integral of the second equation multiplied by 2n to the 


first we obtain an equation which may be written 


t+ (1+ 2) ant) + 2n f —y (. a8 n*) dt + const. 


Since 


u/a*r? differs from n? by a small quantity of at least the first order and we 


know that 2 contains a factor of the second order ; this equation therefore gives. 


x +. = terms of the second order + const. 


*Tbid., vol. 97 (1936), pp. 56-66. 
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provided the integral introduces no small divisors of the first order. The 
point of this equation is the fact that y or z are not present as first order terms, 
The equation can therefore be solved to give an approximation to z. 

Since the last term on the left is of the second order, the second equation 
of 5.1 evidently gives y when we know x. The third equation gives z in the 
same manner as the first gives a. 


6. The actual procedure is so similar to that used in my Theory of the 
Motion of the Moon, that it is not necessary to repeat the various steps here. 
The chief difference is due to the fact that the various approximations to 2, y 
are obtained separately instead of together by the solution of a pair of simul- 
taneous equations as in the work mentioned. It may be noted that the 
a(1+ 2x) + ay of this paper is the same as the uf of the earlier work. 

As the form of © differs somewhat I shall give it completely below, as far 
as it is needed. 

We have * 


where 
14 


43 
= (1+ 2)?’ cos & — nt —e+ LH’) 
+ yr’ sin + & —nt—e+ 
te, 


where n/t + ¢, L” are the mean longitude and equation of the center of the 
sun’s orbit. In 6.1, #, M are the masses of the earth and moon. 
In order to expand these in powers of e’ it will be found convenient to 


use complex quantities by putting 
1+2—w=s, 


E’, are functions of I’ = n’t + ¢-—o’, an angle which disappears when 
e’=(. The expansions of the various functions of 7”, HZ’ needed will the» 
be found in T. M. M.*4 
We also put 
m’ n’2 E+M 
™ a n2 (1 q) m (1 q); q E+M-+m 


8’ Brown, Lunar Theory, Chap. I. 
‘These letters are used to denote my “Theory of the motion of the Moon,” pub- 
lished in vols. 53 and 59 of the Memoirs of the Royal Astronomical Society. 
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THE EQUATIONS OF MOTION OF THE MOON. 789 


If we are using numerical values, that of g can be inserted at the outset. Other- 
wise we put g = 0 and make a correction for it at the end. 


THE HOMOGENEOUS EQUATIONS. 
7. Multiply the equations 5.1 by 2, ¥, 2 respectively, add and integrate. 
Noticing that ¢ occurs in Q, through the codrdinates z, y, z as well as explicitly 


‘and denoting this explicit 


where it is shown and where it is present in L’, r 
occurrence by 00/0t, we have 


e+ #) 2)’ fF dt const. 


Now ¢ occurs explicitly through D=nt+e—n’l—é and through 
nt + &—w’ =I’. We have therefore’ 


But 

(1+ 2) oy sin(D — H’) cos(D 
Hence 


this giving the definition of Q. The right-hand member of 7.1 is therefore 


2—Q(n—n)—v dt +- const. 
The integral vanishes with ¢ 
Again, multiply equations 5.1 by 1+ 2, y, z respectively and add; w 
obtain a result which may be written 


02 


where Q in the right-hand member is supposed to be expressed in terms of r, 9. 
Adding this to 7.1, we note that the negative power of r disappears and 
we get 


in 

op 


ERNEST W. BROWN. 


By multiplying the first two equations of 5.1 by —y, 1 + 2, and adding, 
we get a second equation independent of negative powers of r. This may be 


written after integration 
7.5 =@ -+ const. 


Finally, multiplying this by 2m and adding the result to the previous equation 


we obtain 


+ Q(n-+n’) f dt +- const. 


con’ 


pow 


perl 


alw: 


For calculation subtract from each side of the equation— $m? (a? + 2z’). 


to 
The right-hand member will then contain no term free from one or both of 
eXCe 
the explicit angles 2D, I’ except a small term depending on the square of the 
mai 
ratio of the parallaxes. 
To get an equation for z, we subtract the first of 5.1 multiplied by z from 
the third multiplied by (1+ z) and obtain 
cont 
02 , 02 
77 (1+ 2) (z+ n’z) (1+ == (), in 
zdz dx f 
mig 
The equations 7.6, 7.5, 7.7 are those needed. It will be noticed that if in t 
we express the codrdinate 1 + 2 by a single letter and confine © to the portion each 
w2, the equations are homogeneous and of the second degree in «, y, z apart pro¢ 


from the arbitrary constants. 
8. Constants. The constants of the theory are found as in T. M. M., are 


except perhaps the constant a. It is best to leave this to the end of the work 9). 
and find it from the first of equations 5.1, or 7.3, the latter having the fron 
advantage of also giving 1/7. In substituting the results obtained for 2, y, z 
therein, I recall that here z is defined to contain no constant term. It con- tern 
sists entirely of cosines of multiples of the angles D, 1, I’, 2F, while y consisis Nov 
of sines of the same angles. In the codrdinate z the multiples of F’ present m?. 
are odd, and the harmonics are sines. as b 
9. Loss of accuracy due to small divisors. The approximate para- as t] 
meters are m = 1/13, e = 1/20, y = sin = 1/22, = 1/60, a/a’ = 1/400. the 
The effects of the first three of these generally will be discussed as though each We ¢ 
of them was 1/10. l— 
Small divisors may arise in the x, z equations from terms which have 4 can. 


period of a month, and in the y equation from terms of periods long com- 
pared with a month. The latter will also arise in Q so that y may involve addi 


the square of the small divisor. net | 
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Small divisors are of two kinds. Both of them are independent of the 
mean motion of the moon. Those containing the mean motion of the sun will 
have the factor m and those independent of it the factor m?. Examples of 
the former are 2D —1, D, lI’, 2D — 2l, and of the latter, 2h —2l, D+ TI, 
D+l’—l. It will be noticed that each long period term is associated with 
two short period terms obtained by adding and subtracting / from its argument. 

Suppose that we determine at any one step, as in T. M. M., the terms 
containing a given characteristic, i.e., those which have as a factor any given 
powers of e, y, e’,a/a’. In any such group there are either monthly or long 
period terms but not both. This is because the arguments in any such group 
always differ by even multiples of D, 1, I’, F. Thus as we proceed from lower 
to higher characteristics, short period and long period terms occur alternately, 
except as far as multiples of I’ are concerned. But as e’ = 1/60, and the 
main argument proceeds by decimal places, the additional factor 1/6 takes 
care of accumulations and no special treatment is necessary. 

Let us first take the cases of divisors which have m as a factor. Since Q 
contains the explicit factor m? there is no loss of accuracy from these terms 
in even when a double integration is involved. Thus the only terms which 
might cause a loss are those in 4? + y? + 2? in the z-equation, those in ry — yx 
in the y-equation, and the similar terms in the z-equation. But in such terms 
each of the coefficients is at the highest .1, so that no loss occurs when the 
product is divided by a number of the order .1. 

Of the terms which have divisors with a factor m? the most important 
are the long period terms with arguments 2/—2l, 2D-+ 2l’—2l, 
2D + 2l’ — 2F and the associated monthly terms with arguments which differ 
from these by 

For the first, the term with the lowest order characteristic is the monthly 
term with the argument 2/ — 1, and this is therefore the first to be computed. 
Now in the z-equation (7.6), the terms on the right have the explicit factors 
m*, so that no loss arises from these. In the second degree terms on the left 
és before, each factor is at most .1 so that one place is lost from these. But 
as the term arises mainly from the terms with argument l1, 2F' in a, y and as 
the term with argument 2F can be taken to an extra place without difficulty 
we can avoid this loss. There is also a combination from 2° with arguments 
I—F, — F, but the former can also be taken to an extra place. Hence we 
can avoid all loss for the term with argument 2Ff — I. 

The next in order is the long period term with argument 2/'— 2/. The 
additional part of the characteristic takes care of one decimal place, and the 
het loss is therefore one place. 

In the succeeding monthly term there is no additional loss, because the 
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only place where this might occur is in z? + y? and this involves the deriva- 
tives of the long period terms. 

The second group of terms starts with the monthly term having the 
argument 2) + 2/’—1. The only combination in the second order terms in 
which accuracy is lost is that with arguments 2D + 2I’,1. But the term with 
the former argument having a coefficient of the order .0001 can be taken to 
an additional place without trouble, so that loss of accuracy can be wholly 
avoided. In the succeeding long period term with argument 2D +- 2/’ — 2] 
we lose one place. With the monthly term 2D + 2l’ — 31 no accuracy is lost. 

The next term which needs consideration is that with argument D + /’—] 
and having the characteristic ee’a/a’ so that it is of the fifth order. The pre- 
ceding monthly term with argument D+’ can be found without loss of 
accuracy because neither of the terms which might cause it is so great as .01. 
Hence proceeding as before we see that with this group only one place is lost. 

Finally for the term 2D + 2l/’-— 2F which is the first of the group, the 
chief losses occur from the terms with argument 2D + 2l’, 2F in a, y and 
2D + 2’ —F,Finz As 2D-+ 2l’—F inz is a monthly term with a divisor 
m? there would be a loss if we do not take the term 2D + 2l’ in z to an extra 
place, which we can. Hence z can be obtained without loss and the term with 
argument 2D + 2l’— 2F with a loss of one place. There is no loss with the 
succeeding short period terms. 

Incidentally it may be noticed that there is no loss of accuracy in the 
codrdinate x due to a preceding short period or long period term, and that 
the only loss of accuracy is in y when the division is of order m?. 

Hence in the whole theory, it is not now difficult to see that the maximum 
loss can be reduced to two places of decimals. As I started in T. M. M. with 
11 places of decimals, it is probable that as far as the small divisors are con- 
cerned the theory is correct to 0001 in longitude. This result is considerably 
better than I had hoped. With the equations actually used it was not easy to 
limit the apparent losses in the manner shown above. They may, however, 
have been avoided in the course of the arithmetic without the computer being 
aware of the fact. The verification to which reference was made earlier has 
proceeded sufficiently far to enable me to state that amongst the terms con- 
taining even multiples of D, #—about half of the whole number — there 
appears to be no error in the earlier work greater than 001 in longitude, on 
the assumption that the transformation from rectangular to polar codrdinates 
has been correctly computed. Two of the terms containing odd multiples of 
D apparently have errors of ”.03, ”.04. But it is not certain whether these are 
apparent only, some further work being necessary to decide the matter. 
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SUR LES AVANCES DU NOEUD ET DU PERIHELIE 
D'UNE PLANETE SOUS L’ACTION D’UNE ANNEAU CIRCULAIRE. 


Par JEAN CHAZY, Paris. 


1. Dans mon Livre La théorie de la Relativité et la Mécanique célesle, 
pour discuter les différentes explications possibles de V’avance observée du 
périhélie de Mercure par rapport a la théorie des grosses planétes, j’ai étudié ? 
Vaction séculaire d’un anneau circulaire et homogéne ayant pour centre le corps 
central, sur les mouvements du nceud et du périhélie d’une planéte: et l’on sait, 
selon le principe de la méthode de Gauss, que l’action d’une autre planécte 
animée d’un mouvement circulaire ? équivaut a l’action d’un anneau circulaire 
et homogéne. J’ai appliqué la méthode de calcul classique fondée sur les 
éléments osculateurs et le systéme différentiel de Lagrange. 

Dans un Mémoire posthume * qui rassemble ses recherches de Mécanique 
céleste, Fatou a étudié plus généralement les mouvements presque circulaires 
dun point matériel dans un champ de gravitation présentant un axe de révolu- 
tion et un plan de symétrie perpendiculaires, et dérivant d’ailleurs d’un potentiel 
newtonien, voisin le plus souvent du potentiel du corps central. Fatou applique 
des méthodes directes dans lesquelles les parties principales des avances du 
neud et du périhélie peuvent se déduire de deux équations différentielles 
linéaires du second ordre, dont les coefficients sont presque constants. 

Je veux ici, dans le but de comparer les deux sortes de résultats, et dévelop- 
pant d’ailleurs trois Notes antérieures,* reproduire ou du moins résumer le 
calcul de Fatou. 


2. Nous déterminons le champ considéré par le potentiel 


F(r, z) = p/t+ R(r, z) 


par unité de masse: » désigne le coefficient attractif du corps central O, r la 
distance d’un point matériel P au point O, z la cote du point P comptée sur 


*Toe. cit., t. 1 (1928), pp. 195-203. 

*Tous les mouvements circulaires et tous les anneaux circulaires que nous con- 
sidérons dans cet Article ont pour centre le corps central. 

* Acta Astronomica, Série a, vol. 2 (1931), pp. 101-163. 

‘Comptes rendus des séances de VAcadémie des Sciences, Paris, t. 203 (1936), pp. 
706, 844 et 981. 
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axe de révolution Oz a partir du point O ou du plan de symétrie I, et les 
deux fonctions F et R sont paires par rapport a la variable z. La fonction RP est 
petite le plus souvent par rapport au terme p/r. Les deux variables r et ; 
conduisent 4 des formules plus simples que les coordonnées semi-polaires. 

Considérées comme fonctions des coordonnées cartésiennes correspondantes, 
F et & sont des fonctions harmoniques, et les équations de Laplace relatives 
a ces fonctions se transforment en les deux équations 


Or? 


PR , 20k 


(1) 


= 


Or? 022 r 


équivalentes bien entendu l’une 4 l’autre; l’équation de Laplace relative 4 la 
différence 
F—R 1 


se réduit de méme a l’identité 4 deux termes connue 


fi 20/1 
+7 


Les deux dérivées 0°F'/0z*, 0°R/dz? ont évidemment la méme valeur, 
dailleurs Fatou a montré que leur valeur commune est négative dans les 


différentes applications. 

Nous étudions des mouvements du point matériel P dont lorbite est 
voisine d’une circonférence de centre O située dans le plan de symétrie Il, 
et qui sont d’ailleurs de sens direct. 

En coordonnées semi-polaires, soient r;, 6 et 2 (r= Vr,2 + 2’), Fatou 
considére dans le mouvement du point matériel P l’intégrale des aires 


(2) (C>0), 


Péquation du mouvement relatif sur la projection du rayon vecteur sur le plan Il 


‘ OF 1 
ou 
(3) 


et enfin |’équation de projection sur Oz 


(4) 
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Kin particulier soit un mouvement circulaire ayant lieu dans le plan de 
symétrie II, et ou p et » désignent le rayon et la vitesse angulaire: intégrale 
des aires (2) se réduit a 


po 


et montre que la vitesse angulaire w est constante; et l’équation (3) se réduit 
a P’équation 


ou 0F'/dp désigne la valeur de la dérivée partielle 0F/dr pour r= p, z= 0. 
Et l’on tire de ces équations 


(5) 


3. Kn premier lieu, dans un mouvement qui tend vers le mouvement 
circulaire précédent, c’est-a-dire ot la constante des aires C' a exactement la 
méme valeur que dans ce mouvement circulaire, et ou le rayon vecteur 7 a des 
valeurs voisines du rayon p, calculons la durée de la révolution draconitique et 
Yavance du neud. Dans le mouvement considéré, au second membre de 
Péquation (4), z est en facteur, puisque la dérivée 0F'/dz s’annule avec z, d’une 
expression voisine de 

10F , 
et par suite de 


pet w étant définis par la constante C et par les équations (5), et — q désignant 
la valeur commune des dérivées secondes 0?F'/02? = 6°R/0z? pour r = p, z= 0: 
q est positif puisque cette valeur commune est négative. Des lors l’équation 
(4) peut s’écrire 


(6) = — q) (1464, 


désignant une quantité trés petite. 

Considérant z comme le seule fonction incomme, on sait que l’équation 
(6) s’intégre par deux quadratures; Fatou étudie ces quadratures, mais, si 
nous nous bornons au terme principal de son calcul, nous pouvous dire que 
Péquation (6) est une équation linéaire en z dont les coefficients sont 4 peu 
prés constants, et dont par suite les solutions sont voisines de fonctions 


périodiques de période Vw? + q. 


OF 

—+—=0, 

dp 
la 

= 
p* 
et 
es 
I, 
1 oF 
— —¢ 
Jp 1» 


796 JEAN CHAZY. 


En particulier la durée de la révolution draconttique, soit T, c’est-a-dire 
Vintervalle de temps compris entre deux instants consécutifs ot la cote z 
s’annule en passant du négatif au positif, tend dans le mouvement considéré 
vers la méme quantité 


L’avance du neud, soit 80, c’est-a-dire la différence entre 2a et l’angle 


lim T, = 


des rayons vecteurs correspondant aux deux instants qui précédent, tend, 
puisque la vitesse angulaire tend elle-méme vers », vers la valeur 


lim a0 = —1). 
+ 
La fonction g étant positive en pratique, la valeur algébrique 60 est négative: 
en fait, dans le sens du mouvement, le neud a, non une avance, mais un retard. 


4. En second lieu, concernant le terme principal de l’avance du périhélie, 
le calcul de Fatou peut se réduire 4 ceci. Soit un mouvement qui tend vers 
le mouvement circulaire considéré dans le plan II, mais ayant lieu lui-méme 
dans ce plan. L’équation du mouvement relatif sur le rayon vecteur, soit 


peut, comme précédemment l’équation de projection sur Oz, étre considérée 
comme une équation différentielle linéaire dont les coefficients sont 4 peu pres 
constants. En effet le second membre s’annule pour r = p, d’aprés la définition 
de p et parce que la constante C a la méme valeur que dans le mouvement 
circulaire limite, et d’autre part il a pour dérivée pour r= p 

con 


quantité que nous supposons négative: donc l’équation peut s’écrire 


0? 


e, désignant une quantité trés petite. Donc les solutions de la derniére équa- 
tion sont voisines de foncitons périodiques dont la périod est la quantité 
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En particulier la durée de la révolution anomalistique, soit Ts, c’est-a-dire 
Yintervalle de temps compris entre deux instants consécutifs ot le rayon vecteur 
rest minimum, tend vers la méme quantité. On peut d’ailleurs transformer 
cette quantité, et écrire, compte tenu des équations (5) et de l’équation de 
Laplace mise sous la forme (1), 


eF 20F 


—q désignant comme ci-dessus la valeur de la dérivée 0°F'/02? pour r = p, 
z= 0. La révolution a done pour durée limite 


lim 
V 
et Pavance du périhélie, soit 8m, c’est-a-dire la différence entre 2a et langle 


de deux rayons vecteurs minima consécutifs, tend vers la valeur 


lim 6@ = 2a 1) 
Vw? — 


qétant possitif, la valeur algébrique d@ est positive aussi: tandis que le nwud a, 
dans le sens du mouvement, un retard, le périthélie au contraire a effectivement 


ume avance. 


5. Nous allous montrer encore que dans les deux cas les résultats obtenus 
peuvent étre déduits de la méthode classique de détermination des petits 
mouvements d’un systéme matériel dont la position dépend d’un nombre fini 
de paramétres ; et cette méthode donne méme une extension du résultat obtenu 
dans le second cas. 

En effet, dans le mouvement du point P, fixons encore la constante des 
aires C, éliminons l’angle polaire 6 au moyen de l’intégrale des aires (2), et 
considérons dans le plan passant 4 chaque instant par l’axe Oz et par la position 
du point P, le mouvement relatif de ce point défini par le rayon vecteur r et 
la cote z, ou par les coordonnées cartésiennes 1, et 2. 

La force relative dans ce mouvement est a priori la résultante de la force 
primitive, qui dérive de la fonction F’, de la force centrifuge et de la force 
centrifuge composée correspondantes. Mais la force centrifuge est, au fac- 
teur — 2 prés, égale au produit vectoriel de la rotation instantanée du plan 
passant par Oz et par le point P, rotation dirigée suivant Oz, et de la vitesse 
lative du point P, vitesse située dans le méme plan, qui est le plan du mouve- 
ment relatif que nous étudions. Donec la force centrifuge composée est per- 
pendiculaire 4 ce plan, et par suite n’intervient pas dans le mouvement relatif 


2 


Op? p Op 
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considéré, en particulier y a un travail nul dans un déplacement virtue 
quelconque. 

La force centrifuge n’intervient que par sa projection sur le plan du 
mouvement: cette projection se déduit de l’accélération normale du _ point 
coincidant, c’est la force perpendiculaire a Oz, et de valeur algebrique 6’*r, dans 
le sens ot l’on s’éloigne de Oz; son travail dans un déplacement virtuel du 
point P, de composantes 67, 6z, a pour expression 


‘ C2 C? Cc? 
(- =) — 2”) | 


La force relative dérive ainsi, en fonction des variables r et z, de la fonction 


C? 
U=- 


On peut dire aussi que l’équation des forces vives du mouvement primitif, 
soit 


+ + == 2F + 2h, 


h désignant la constante des forces vives, se transforme dans le mouvement 
relatif ou l’on a 


1 2? 
— 
en l’équation 


de sorte que le travail des deux forces centrifuges dans un mouvement relatif 
réel quelconque, est égal nécessairement, 4 une constante additive prés, a la 
quantité — [C?/2(1r? — 2?) |. 

La valeur z = 0, une valeur commune p du rayon vecteur r et de la dis- 
tance r, 4 l’axe Oz, et la vitesse » donnée par l’équation p*o = C, déterminent 
un mouvement circulaire du point P, si ces valeurs correspondent dans le 
mouvement relatif considéré 4 une position d’équilibre relatif. Or la dérivé 


4 oF 
Oz (7? — Oz 


est nulle identiquement avec z, puisque F est fonction paire de z: donc les 
valeurs p, et z= 0 doivent annuler l’expression de la dérivée 


le 
va 


sel 


| 
et 
ca 
que 
det 
sta 
(7) 
diff 
équ 
cha 


AVANCES DU NOEUD ET DU PERIHELIE. 


oF 
Or’ 
soit 
12 By 
+ 
p p p 
équations équivalentes aux équations (5). 

Pour étudier les mouvements du point P voisins du mouvement circulaire 
correspondant a cet équilibre relatif, nous sommes ramenés a étudier parmi 
les mouvements relatifs considérés les petits mouvements voisins de la position 
Véquilibre obtenue. Appliquons la méthode classique avec r—p et z comme 
variables indépendantes. 

La force vive relative est 

— 22’)? 


— 22 


et se réduit a 1”* + 2” si lon fait r = p, z = 0 dans les coefficients de 1””, 1’2’, 


7*. De méme, dans la fonction de force réduite, le terme rectangle est nul, 


car la dérivée 0°?U /drdz est nulle a cause de la parité de F' et de U; la dérivée 
?U/dz* se réduit a la quantité négative 


selon la notation antérieure; enfin la dérivée 0?U /dr? se réduit a4 la quantité 


= — bo? + = — + 

p Op” 
que nous transformons comme dans le calcul antérieur, et il suffit que cette 
derniére quantité soit négative pour que la position d’équilibre considérée soit 
stable. La fonction de forces réduite est ainsi, aprés multiplication par 2, 


La méthode classique donne alors les deux équations 


(7) § = — (wo? —q)(r—p), 
) == — + q)z. 


Nous nous trouvons donc dans le cas simple ou, au lieu de former un systéme 
différentiel linéaire d’ordre 4, les équations des petits mouvements sont deux 
équations différentielles linéaires du second ordre déterminant séparément 
chacune une variable. Les périodes fondamentales sont les deux quantités 
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Qa 


et — 
Vo? —q Vor +49 
considérées précédemment. Donc, selon la proposition générale, dans les 
mouvements du point P voisins de mouvements circulaires, le rayon vecteur r 
et la cote z sont représentés par des développements suivant les puissances des 
valeurs initiales 19 — p, Zo, 1”, 2’o. Les termes linéaires de ces développements 
sont donnés par les deux équations linéaires (7%), ce qui met en évidence que 
les durées des révolutions anomalistique et draconitique tendent respectivement, 


quand les valeurs initiales tendent vers zéro, vers les deux quantités 2a/V/ w* —q 


et 2x/Vwo* + q, dot se déduisent immédiatement les limites des avances du 
périhélie et du nceeud. [Remarquons que ces limites de la durée de la révolution 
anomalistique et de l’avance du neeud sont valables dans un mouvement quel- 
conque voisin d’un mouvement circulaire, c’est-a-dire sans supposer comme 
précédemment l’orbite située tout entiere dans le plan de symétrie du champ 
de gravitation ]. 

Les quatre valeurs limites étant obtenues ainsi de fagons trés simples, 
je les formerai dans un autre Article en introduisant les éléments osculateurs 
du mouvement et le systéme différentiel de Lagrange. 
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DIE SYMMETRISCHEN PERIODISCHEN BAHNEN DES 
RESTRINGIERTEN DREIKORPERPROBLEMS IN DER 
NACHBARSCHAFT EINES KRITISCHEN 
KEPLERKREISES. 


Von Ernst HOLpeEr. 


Beim restringierten Dreikérperproblem ergibt sich im folgenden die 
Mannigfaltigkeit der (zur Syzygienachse symmetrischen) periodischen Bahnen 
(K) in der Nachbarschaft eines (“kritischen ”) Keplerkreises K°® mit ganz- 
zahligem Hillschen Periodenquotienten m als Sattelfliche 


(I) p= E(x — + - E(x, €), £(0,0) =b~0. 


Diese wird von der Ebene verschwindenden Massenprozentsatzes » im Punkte 
p= « = € = 0 beriihrt und in den beiden Kurven geschnitten, die den Kepler- 
kreisen € = 0 und den synodisch nach einem Umlauf geschlossenen Kepler- 
ellipsen entsprechen; «x ist der Zuwachs der Jacobischen Konstante von K 
gegeniiber der von K°, € im wesentlichen der (m+ 1)-te Fourierkoeffizient 
in der Kosinusentwicklung des Radiusvektor von K nach den Vielfachen der 
Elongation, ein Parameter, der sich fiir »—0 niherungsweise auf die Ex- 
zentrizitat reduziert ; a= (m/(m-+ 1) )*/ ist der Radius von K°. 

Fixieren wir mit Wintner ‘+ den Massenprozentsatz » > 0, was der Strém- 
grenschen Gruppenfortsetzung entspricht, so ergeben sich auf der Léosungs- 
fliche hyperbelartige Kurven, welche die von Poincaré? betonte analytische 
Fortsetzung der Lisungen einer natiirlichen (isoenergetischen) Schar erster 
Sorte (iiber den kritischen Wert «x = 0 hinaus) in die Lésungen einer natiir- 


1A, Wintner, a) “itber eine Revision der Sortentheorie des restringierten Drei- 
kérperproblems,” Ber. der Sdchs. Akad. d. Wiss. zu Leipzig, Bd. 72 (1930), S. 3-56; 
b) “Sortengenealogie, Hekubakomplex und Gruppenfortsetzung,” Mathematische Zeit- 
schrift, Bd. 34 (1931), S. 350-402. Auf diese Arbeiten stiitze ich mich fortwahrend, 
auch wegen ausfiihrlicherer Literaturangaben sei auf sie verwiesen. Die von mir be- 
nutzte graphische Darstellung der Lésungsmannigfaltigkeit ist freilich eine andere als 
die auf der Massenschranke beruhende Wintnersche. 

2H. Poincaré, “Les solutions périodiques et les planétes du type d’Hécube,” 
Bull. Astr., t. 19 (1902), S. 177-198, insbes. S. 185; die Figuren dort sind nur schematisch 
zu verstehen. Die zweite scheint mir auch nicht in allen Teilen die Tabelle von Brendel, 
“Theorie der Kleinen Planeten 1,” Abh. der Ges. d. Wiss. zu Géttingen, Neue Folge 1}, 


Nr. 2 (1898), S. 128, wiederzugeben. 
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lichen Schar zweiter Sorte vermitteln—Scharen, die nach den Untersuchungen 
von Levi-Civita * bezw. Birkhoff * feststehen. 

Als Gegenstiick werden auch die Verhiltnisse in der Umgebung eines 
Keplerkreises K°® mit rationalzahligem Periodenquotient m= q/J, J > 0, 
(q,/) = 1 wenigstens angedeutet, die sich ahnlich behandeln lassen. Dann 
gehort K° zwei verschiedenen Mannigfaltigkeiten von Lésungen an, welche die 
synodische Windungszahl 1 bezw. J haben, also zum ersten bezw. zweiten 
Poincaréschen Genre gehéren und sich, wenn man wollte, in natiirliche Scharen 
erster bezw. zweiter Sorte zusammenfassen lieBen. Auch in jeder Ebene 
= const. existiert ein Doppelpunkt, durch den zwei verschiedene Losungs- 
zweige hindurchgehen. 

Die aus (I) approximativ folgende Beziehung 


im wesentlichen, hat sehr elegant Tisserand * hergeleitet — durch storungs- 
theoretische Niherungsbetrachtungen, die sich aber wohl nicht so leicht zu 
einer strengen Methode fiir die Gewinnung des vollstindigen analytischen 
Zusammenhangs ausgestalten lieBen. Bei Tisserand ist zundchst nicht von 
periodischen Lésungen die Rede. Erst in einem kurzen Zusatz zur Darstellung 
seines Traité faBt er die urspriinglichen Entwicklungen von Radiusvektor und 
wahrer Linge als Fourrierreihen auf, deren Koeffizienten die in der storenden 
Masse linearen Approximationen der entsprechenden Koeffizienten einer 
periodischen Lésung sind. Diese hatte inzwischen, geleitet durch die Delaunay- 
sche Mondtheorie, Hill ® bemerkt und zugrundegelegt, als er im restringierten 
Dreikérperproblem Saturn—Titan—Hyperion, auf das iibrigens auch Tisserand 
seine Theorie angewandt hatte, die Bewegung des Hyperion und die Titan- 
masse berechnete. Analytisch kénnte die periodische Lésung mit derselben 
Methode der unendlich vielen Koeffizienten ermittelt werden, die Hill bei der 
Variationsbahn des Mondes entwickelt hat. Diese Methode hat neuerdings 
Wintner* mathematisch begriindet und er hat auch die Anregung zu ihrer 


8T, Levi-Civita, “Sopra alcuni criteri di instabilita,” Annali di Matematico (3), 
vol. 5 (1905), pp. 221-308. 

*G. D. Birkhoff, “The restricted problem of three bodies,” Rendiconti Palermo, 
vol. 39 (1915), pp. 265-334. 

SF. Tisserand, a) “Sur un cas remarquable du prebléme des perturbations,” 
Bull. Astr., t. 3 (1886), pp. 425-433; b) Traité de la Mécanique Céleste, t. 4 (1896), 
p. 113 ff., insbes. p. 116 f., sowie p. 491. Diese Untersuchung sowie die folgende von 
Hill hat Wintner, loc. cit. 1), a) hervorgehoben. 

¢G. W. Hill, “The motion of Hyperion and the mass of Titan” (1888), Coll. Math. 
Works, vol. 2 (1906), pp. 135-143. 

7 A. Wintner, a) “Zur Hillschen Theorie der Variation des Mondes,” Mathematische 
Zeitschrift, Bd. 24 (1925), S. 259-265; b) “Uber die Differentialgleichungen der 
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Anwendung auf das Problem der synodisch ganzzahligen Kommensurabilitaéten 
gegeben, die von Martin * ausgefiihrt worden ist—allerdings nicht bis zu einem 
vollstindigen Uberblick iiber die Mannigfaltigkeit der Lésungen. Dazu muss 
man in der Rechnung noch einen Schritt weiter gehen und auch das Glied xé 
in (I) ermitteln. Ich tue das hier auf Grund der Lichtensteinschen ® Theorie 
nichtlinearer Randwertprobleme, die mir? am nichsten liegt und deren ich 
vielleicht auch deshalb hier gedenken darf, weil sie ja eine grosse Analogie mit 
dem von Hill tatsachlich durchgefiihrten Verfahren hat: das Randwertproblem 
der symmetrischen Konjuktion und Opposition ist fiir symmetrische Bahnen 
mit dem der Periodizitaét gleichwertig, und die Berechnung der Stérungen 
mittels mechanischer Quadraturen wird in der analytischen Theorie durch eine 
unter Zuhilfenahme der Greenschen Funktion gebildete Integrodifferential- 
gleichung ersetzt, die nach der Methode der sukzessiven Approximation geldst 
wird. 


1. Unter dem Einflu8B des Newtonschen Gravitationsgesetzes sollen zwei 
Massenpunkte M, (Masse 1—yp, “Sonne”) und M, (Masse p, “ Jupiter ”) 
umeinander Kreise beschreiben; der gemeinsame Radius der Relativbahnen 
sei gleich der Liingeneinheit. Die Einheit der Zeit ¢ sei so gewahlt, daB die 
konstante Winkelgeschwindigkeit der Linie M,M, gleich Kins ist. 

In der Ebene der beiden Relativbahnen bewege sich ein Aufpunkt P = Py 
(“kleiner Planet”), welcher der Gravitationswirkung von M, und M, aus- 


Himmelsmechanik,” Mathematische Annalen, Bd. 96 (1926), S. 284-312; c) “ Uber die 
Existenz der Hillschen Mondbahn of maximum Junation und der Poincaréschen Schling- 
bahnen,” Mathematische Zeitschrift, Bd. 28 (1928), S. 430-450; d) “Uber die Kon- 
vergenzfragen der Mondtheorie,” Mathematische Zeitschrift, Bd. 30 (1929), S. 211-227. 

*M. Martin, “Upon the existence and non-existence of isoenergetic periodic per- 
turbations of the undisturbed circular motions in the restricted problem of three bodies,” 
American Journal of Mathematics, vol. 538 (1931), pp. 259-273. 

°L. Lichtenstein, ‘“ Untersuchungen iiber die Gestalt der Himmelskérper, IV. Abh. 
Zur Maxwellschen Theorie der Saturnringe,” Mathematische Zeitschrift, Bd. 17 (1923), 
§. 62-110; “ V. Abh. Neue Beitrage zur Maxwellschen Theorie der Saturnringe,” Seeliger- 
Festschrift, S. 200-225 (Berlin, 1925). 

Vol. KE. Holder, a) “Mathematische Untersuchungen zur Himmelsmechanik,” 
Mathematische Zeitschrift, Bd. 31 (1929), S. 197-256, insbes. S. 225-239. Eine Anwen- 
dung der Methode auf das Problem der synodisch ganzzahligen Kommensurabilititen 
enthilt meine Note b) “ Die Verzweigungsgleichungen fiir die kritischen Kreise des 
restringierten Dreikérperproblems,” Ber. der Sdchs. Akad. der Wiss., Bd. 83 (1931), 
8. 179-184. Aus dem dort gezeigten ausnahmslosen Nichtverschwinden des Koeffizienten 
von # in der Verzweigungsgleichung folgerte ich damals mit Martin die Nichtexistenz 
einer von K° ausgehenden natiirlichen Schar erster Sorte. Dieser Schluss muss aber 
ausdrticklich auf die von Hrn. Martin betrachteten Scharen eingeschriinkt werden. Das 
Vollstindige Ergebnis wird erst hier gegeben und liefert eine positive Massenschranke 
auch fiir «= 0, 0. 
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gesetzt ist, ohne diese Korper zu stéren. Als Koordinaten von P nehmen 
wir die kartesischen Koordinaten z, y in dem rotierenden Achsenkreuz, dessen 
Ursprung in M, liegt und dessen z-Achse nach M, hin gerichtet ist. 

Die Lagrangesche Funktion dieses restringierten Dreikorperproblems ist 
(1) A=A(z,y,%,¥;4) =T+f+ U, 
wo 
(2) 
und der Punkt die zeitliche Ableitung bezeichnet. 

Da die Zeit ¢ in A nicht explizit vorkommt, gilt das Jacobische Integral 
(4) T—U=k. 

2. Man gewinnt nun die zur Jacobischen Konstanten k gehorigen Bahn- 


kurven der Gestalt nach als Extremalen des geometrischen Variationsproblems 
8 fA*dr = 0 mit der Maupertuisschen Wirkungsfunktion 


(5) A* = [2(U +k) + f*, 


wobei 7'*, f* aus 7’, f entstehen, wenn man die Geschwindigkeitskomponenten 
z, y durch die Ableitungen 2’, y’ nach dem Parameter + ersetzt. Dieser ist 
aus Homogenititsgriinden willkiirlich und soll als der durch 


(6) C08 7, y=rsint 
definierte Polarwinkel + (“Elongation”) genommen werden, der also jetzt 
unabhiingige Variable ist, wihrend die gesuchte Funktion der Radiusvektor 
r==r(r) ist. Es wird dann 
(7) A* = [2(U + k) (7? + 1°) 4 2°. 

Der zeitliche Ablauf r= 7(t) der Bewegung, der uns hier nicht weiter 


interessiert, bestimmt sich wegen (4) hinterher durch eine Quadratur aus 


2(U +k) 


(8) di 


8. Wir untersuchen hier die Bahnen K, fiir die der Radiusvektor eine 
in + mod 2z periodische Funktion r(r) ist und die in der Nachbarschaft der 
zum Zweikérperproblem gehdrigen Keplerschen Kreisbahn 


(9) K®: g=acosr, y=asinr 


von P, liegen. Die Bahn K° hat die Jacobische Konstante, vgl. (51), 
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(10) ko = — (2a) — ab 


und die mittlere Bewegung n mit n?a* = 1. 
Der Charakter der gesuchten Losungsmannigfaltigkeit (K) hangt wesent- 
lich davon ab, ob fiir diese Ausgangslésung K° der Hillsche Periodenquotient 


(11) m=1/(n—1) 


eine ganze Zahl ist (singulirer, kritischer Fall) oder nicht (regularer Fall). 
Die mittlere Bewegung ist jetzt n 1+ 1/m. 
Ubrigens sind aus dynamischen Griinden die Werte 


(12) m=— 1, 0, 0, 


auszuschlieBen ; sie entsprechen bezw. den Grenzfallen, daB K° ins Unendliche 
riickt, in die Sonne hineinfallt, mit der siderisch direkten oder retrograden 
Jupiterbahn identisch ist. 

Hine eingehendere Betrachtung bedarf nur der singulare Fall; er kann 
nur fiir einen siderisch direkten Planeten eintreten. Denn die oberen und 


unteren retrograden Kreise gehéren zu 
(13) 
konnen also niemals kritisch werden. 


4. Fiir die zu untersuchenden Bahnen (K) sei der Radiusvektor 
(14) r= a(1 + ¢) 


vesetzt; £ = £(7) mul die aus der Grundfunktion A*, Formel (7), abgeleitete 
Lagrangesche Gleichung fiir £ befriedigen. Daf K in einer Nachbarschaft 
von K° liegt, bedeutet, daB fiir K der Massenprozentsatz » und die Differenz 
«== k —k°® der Jacobischen Konstanten absolut genommen kleine Werte be- 
sitzen und daf auch £(r) und seine Ableitung ¢’(7) dem Betrag nach unter- 
halb gewisser Schranken bleiben. Wir entwickeln am besten gleich A*, wo 
auch U = U(r,£;,«) aufzufassen ist, nach Potenzen von p, x, €, ¢’. Dies 
wird fiir jeden einzelnen Wert von a gemacht. 
Zum Zwecke der Entwicklung schreiben wir ausfiihrlich 


(15) +h) =a2(1+ £)?— 2ua(14+ £) cose + p? — (204+ a") + 
+ 2(1—p)a*(1 + 
+ 2p[a?(1 + ¢)? — 2a(1 + ¢) cost + 1]4 


und bekommen fiir die Lagrangesche Funktion den Ausdruck 
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(16), A*—=a[(V + + (14+ 24+ 0%) 
mit 

(17) V=at(1+£)? + 4 £)7— (2a 4 a), 

(18) W— [d? + 2a(a— cos + —a(1 + £) cosr—a4(14 
hier ist 

(19) d? =1—2acost+a? 

und, was sofort benutzt wird, (11) zufolge 

(20) a*=(m+1)?m~. 


In der Entwicklung von V und W brauchen wir, wie sich herausstellen 
wird, die Anfangsterme bis zum dritten bezw. ersten Grad in ¢: 


(21) (a—at)? 4 2(a? — 0 + (a? — 

= a’m{1 + 2(1—a*) + (1 2a*) — 

a?m-*{1 —2(2m + (3m? + 4m + 2)e2—2(m + 
und 


(22) W,—dt—a*—acosr-+ (a' + (cos r—a)ad* —a cos r)é. 


Wir erhalten daher 


= — + (1+ 6)? 


Dabei ist 


(24) Vet am*{1—(2m 4 
+}, 


(25) + (2m + 1)£+4+ 4 (9m? 4+ 8m +4 
V+ ist der Abschnitt davon bis zu dem Glied mit £ und 


(26) = —acosr-+ [a* + ad*(cos r— a) —acost 
+ (2m +1) 


A®, A®) und nachher A‘) enthalten auGer einigen fiir die Lagrangesche 
Gleichung belanglosen, von ¢, ¢’ unabhiingigen Gliedern solche Glieder dritter 
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Ordnung in x, p, , ¢’, die den Faktor » enthalten, ferner Glieder vierter und 
héherer Ordnung. Schlieflich wird 


(27) a-*mA* = 362 —4(m + — (m+ 1)? — m(m + 
+ a-*m?{2(m + 1) &« — a-*m? (38m + 2) Sx? 
+ 9% (m +1) (3m + 1) 
+ [2(m+1)d*— (2m-+ 3)acosr 
+- ad-* cos r — a?d-* — (2m + 1)a*]fu} + 
= — 3(m + + 
abgekiirzt. 


5. Die Lagrangesche Gleichung ist 


= — S{k(1l —1) + — 1) 
+ (2Biosoxg — + Borrop + Wl) 


Hier enthalt ¥“) die Glieder zweiter Ordnung, soweit sie » als Faktor haben, 
sowie die Glieder dritter und hdherer Ordnung. 

Die Ermittlung der in + mod 2z periodischen Losungen der Lagrangeschen 
Differentialgleichung (28) fiihrt im reguliren Falle m~1, + 2,- - - mittels 
der Greenschen Funktion von Z(é) zu einer nichtlinearen Integrodifferential- 
gleichung, deren Lisung £ = {(7;x,») sich fiir kleine Betrige von x, » durch 
sukzessive Approximationen ergibt und auch die Lagrangesche Gleichung (28) 
erfiillt und periodisch ist. 

Im kritischen Fall m = 1, + 2, + 3,:-- hat L(Z) = 0 aber die periodischen 
Nullésungen Z = cos pr, sin pr mit p= m + 1, und es muf mittels der verall- 
gemeinerten Greenschen Funktion G (7,74) zur nichtlinearen Integrodifferential- 
gleichung 


(29) f(r) = pr + 7sin pr + G(r, 7) Vx dry, 
70 
mit 
(30) {cos pr dr, {sin pr dr 
0 0 


ibergegangen werden. 


6. Sind p,« und auch é,7 beliebig vorgegeben in einem gewissen vier- 
dimensionalen Interval Q um den Nullpunkt »=« === 0, so ergibt 
sich durch sukzessive Approximationen genau eine Lésung = p, x, &, 7) 
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von (29), die sich in eine Potenzreihe nach x, p, é, 7 entwickeln laBt. Damit 
die gewonnene Funktion £, die in + mod 2z periodisch ist und die vorgegebenen 
Werte é,7 als Parameter im Sinne von (30) hat, aber auch eine Losung der 
urspriinglichen Lagrangeschen Differentialgleichung (28) ist, muB ¢ noch die 
sog. Verzweigungsgleichungen 


(31) R= cos prdr=0, S = (a/m)? sin pr dr = (0) 
befriedigen. 

Schreiben wir in { die Glieder erster Ordnung, die aus der ersten Ap- 
proximation entnommen werden kénnen, aus, so kommt 


(32) + But 


der Koeffizient 8 von p ist fiir das folgende belanglos. 
Zuerst berechnen wir 


(33) k= {[12Booso + 4(m + 1)? Boorz | (m + a 
+ (a/m)?[2Byo20 + 2(m + 1)? + p+ RY 
— {—12m(m +1) + (—4-+1) (m1)? 4 3(m +1) (3m +1) 
+ + RO, 


also 
(34) R = — 6m(m + 1)KE + 2BOMM + RM, 
wo 
(35) 2Bim1) (a/m)? 7), COS (m + dr 


und R) sowie nachher S“) Glieder zweiter Ordnung mit dem Faktor yp sowie 
Glieder dritter und hdherer Ordnung enthalt. 
Die analoge Rechnung gibt 


(36) S =— 6m(m+ 
der Koeffizient von » verschwindet hier offenbar aus Symmetriegriinden. 


7. Vor allem benétigen wir das Nichtverschwinden des Koeffizienten 
von p, 2B) = am-*- 2A) in der Bezeichnung meiner friiheren Arbeit 5), 
in der ich den diesbeziiglichen Nachweis allgemein erbracht habe. 

Man fiihrt dazu die Laplaceschen Koeffizienten A,” ein, die durch die 
Fourierentwicklung 


(37) (1 — 2a cos + @?)-§ == + 23 Ag? cos pr, Ag? Ag? 
p=1 

(0<a< 1, s reell) 
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definiert sind. Sie kénnen bekanntlich mit Hilfe der Gaufschen hypergeo- 
metrischen Reihe dargestellt werden 


2 


Fir uns ist vor allem wichtig, daB fiir 0 <s simtliche Glieder der Reihe 
positiv sind. 
Wir setzen jetzt s = 4 und unterscheiden: 


1, Fall eines direkten unteren Planeten ( >": ). Hier gilt mit «=a 


(39) 2B) — — 20°A?,,, + (AP + AP), 


8+1 
Die As? genitigen nun bekanntlich den Identitaten 
(40) = Ap? — (1+ @) + (A +A 


d 2 
(l1—«@ ) AP eat. 


Aus (89), (40), ergibt sich durch Subtraktion 
(41) 2B) = (4p — 1)A?,/2 +- (1 — a?) APs BY) — /2 => 0. 
2, Fall eines direkten oberen Planeten Dann ist mit «= 1/a 


(42) = 4paA — a? + A ) ; 


&+1 


dies mit a multipliziert und davon (40), subtrahiert, gibt 


(43) 2aB?) (4p — 1) A?1/2 (1 a?) APs jo, 


aB) — (2p — 1) A? < 0. 


(In der Arbeit 6) hiess iibrigens in diesem 2. Fall aB® einfach B®). 


8. Fiir »—0 haben die Keplerschen Kreise verschwindende Parameter 
§,. Die eigentlichen Keplerellipsen miissen die feste grosse Halbachse a, 
gegeben durch (20). mit m1, + 2,- - -, haben, um im synodischen Achsen- 
kreuz 2, y die Winkel-Periode 2x zu haben. Was die iibrigen Elemente betrifft, 
8 nehmen wir als Epoche t =0 den Zeitmoment, wo die Bahn von Py die 
positive Syzygienachse kreuzt. Von dieser, als X,-Achse siderisch fixiert, 
tihlen wir (alles siderisch) die mittlere Lange A = nt, die wahre Lange v und 
die Perihellange @ ; die mittlere Lange der Epoche ist e = 0. Die Exzentrizitat 
é sowie | aw | oder | ~—v7| seien klein vorausgesetzt. Sei noch die exzen- 
trische Anomalie wu, die Flaichenkonstante c= na?(1— e?)4, so gelten die 
Bezichungen 


= 
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(44) A—@W=u—esin u, r—a(1—ecosu), dv =cr*dt = (c/n) rd, 
Somit gelten fiir Radiusvektor und wahre Lange die Potenzreihen nach e: 
(45) v=A+ 2sin 


Die Ellipsen, fiir die | w | bezw. | a —- | klein sind, bilden eine zusammen- 
hangende stetige Schar, die man erhalt, wenn man | w | klein und —1 <¢ 
< +1 nimmt. 

Fihren wir jetzt den von der Syzygienachse gezihlten Polarwinkel ein: 


(46) nm") 4+ 2sin (A— DW) 
(47) pr=(m+1)r=A+ 2pesin (A—w)+°°-, 

so ist umgekehrt 

(48) LA=pr—2pesin 

(49) cos (A— @) = cos (pr @) + 2pe sin? 


Nach (45), wird jetzt fiir eine Keplerellipse die durch (14) definierte Grosse 
£=—{(rle,@) eine Potenzreihe in e,@ deren Koeffizienten (sogar schon 
mod 2x/p) periodische Funktionen von 7 sind. Unsere Parameter werden 


(50) 


Es ist daher auch £ = {(r|é, @) eine Potenzreihe in é, w. 
Fiir die betrachteten Keplerellipsen ist die Jacobische Konstante die 
Differenz von absoluter Energiekonstante h und Flachenkonstante c 


Ks ist daher 

(52) «—=— (¢— =— col (1— = feof? 
eine Potenzreihe in é, w, die der Gleichung 

genigt. 


9. Bei der Diskussion der Verzweigungsgleichungen (31) muf ich mich— 
jedenfalls im Moment—auf die Bahnen beschranken, die zur Syzygienachse 
symmetrisch sind: £(—7+) —€(r). (Fiir lassen wir in den Potenz 
reihen £,x, P der vorigen Nr. das Argument w = 0 einfach weg). Es ist dann 
von vornherein 7 = 0 zu setzen, und dies ist auch hinreichend dafiir, daf die 
Lésung der Integrodifferentialgleichung (29) die Symmetrie bzgl. der Sygy- 
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gienachse hat. Die zweite Verzweigungsgleichung § —0 ist dann identisch 
in p, «x, € erfiillt; es muB also S den Parameter y als Faktor enthalten. 

Wir haben nur noch die Bedingung = 0 fiir die Parameter p, x, 
mu studieren. Da B‘'?) 40 ist, bestimmt sich der Massenprozentsatz als 
Potenzreihe in x, é 


(54) p= f(x, £) b = 38m(m + 0. 


Nun beachte man: die Potenzreihe £(7|€) in € zusammen mit der Potenz- 
relhe (52) «—x«(é) und »=0O in den Ausdruck RF eingesetzt, gibt eine 
Potenzreihe in €, die identisch verschwindet. Auf der Lésungsfliche (54) im 
Raum der Koordinaten x, é, » liegen also jedenfalls die Punkte der parabelar- 
tigen Kurve (53) P(x,€) =0 in der Ebene » = 0, ferner, wie man ebenso 
sieht, die Punkte der Geraden 0, »=0O. Auf jeder der beiden im 
Ursprung irreduziblen algebroiden Kurven muf also sicher f(x, é) sein. 
Daher gilt mit einer Potenzreihe H(x,é) die Teilbarkeit ** 


(55) f(x, €) = B= E(x — -) B(x, B(0,0) 


Der Punkt «x = = 0 ist ein Sattelpunkt der Losungsfliche (54). Die 
Kurven » = const. auf der Losungsfliche haben in der Nahe von «x —£=0 
hyperbelartigen Charakter.? 

Betrachtet man (etwa unter der Voraussetzung m —1,2,---) die zum 
festen Wert x, <0 gehdrige “natiirliche ” Losungsschar 3, (erster Sorte) 
sowie die zum festen Wert x.> 0 gehdrige natiirliche Lésungsschar >. (zweiter 
Sorte mit € < 0)—natiirlich nur innerhalb des Quaders Q —, so kann man 
wohl sagen, dafs jeder Loésung von 3, durch Gruppenfortsetzung auf der 
Lésungsfliche (also bei festem » > 0) eine Lésung von &, entspricht. Analog 
sind die Verhiltnisse fiir positiven Exzentrizitatsparameter €, dem bei » = 0 
die Perihellinge @ — = entspricht. Nur sind jetzt beide «:, x2 > 0, sie kénnen 
also auch gleich sein. 

Auch die zum festen Wert x = 0 gehorige natiirliche Schar existiert. 
Ks ist dann 


(56) po — +---, 
also 


Diese vom kritischen Kreis ausgehende natiirliche lineare Reihe 3% von 


Vel. Osgood, Lehrbuch der Funktionentheorie I1,, Leipzig-Berlin 1924, p. 87, 2. 
Zusatz. Die entsprechende algebraische Tatsache griindet auf einem von meinem Vater 
hewiesenen Satz: O. Holder, “ Zum Invariantenbegriff,” Math.-naturw. Mitt. Wiirtt., Bd. 
1 (1884), S. 60-62. 
72-Vgl. Hadamard, Cours d’analyse I, p. 367 f., Paris, 1927. 
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periodischen Losungen entgeht der Betrachtungsweise von Martin, der p = z* 
schreibt und, Joc. cit., p. 267, 
(58) f= O(2*) = O(u*"*) 


voraussetzt. 


10. Auf der Losungsmannigfaltigkeit liegt, wie noch gezeigt werden soll, 
die vom kritischen Kreis K°® ausgehende isoperiodische Schar (der Lésungen 
von der synodischen zeitlichen Periode To = 2rm) in der Ebene p= 0, sie 
besteht also nur aus den Keplerellipsen vom Periodenquotient m.** 

Fiir die Lésungen der Integrodifferentialgleichung (29) ist die zeitliche 
Periode, durch (8) definiert, 


Setzen wir hier (54) zufolge p= b«é+--- ein, so ergibt sich (auf der 
Lésungsfliche) 
(60) T = ). 
Es geht nun 

OT , OT Op oT 
und es kann bei der an der Stelle »—«—£—0O zu nehmenden Ableitung 
rechts vor der Differentiation » 0 gesetzt werden, d.h. man berechnet 
den Limes lings der von « abhingigen Lésungsschar p= von (29), 


18 Vgl. Wintner, loc. cit., 1, a) S. 49. 
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die offenbar die Schar der zu » 0 gehérigen Keplerschen Kreise mit den 
Halbachsen a(1-+ y) sein In dieser Schar ist 


(62) T= + = T,(1— — 1) 7 
=T) (1+ %(m+1)y+: 

und 

also fiir «x = 0 

ar 

Ox 

Da sich in (61) nunmehr der Limes rechts von Null verschieden ergeben hat, 

geht durch x = € = 0 genau ein linearer regulirer Zweig x = «(é) der Kurve 

E(x,é) = T, hindurch. Dieser muB notwendig (52) «= + +, 0 


(64) = 3T,(m + 1)ma-* = 6rm*a* 0. 


sein. 

11. Nur andeuten kann ich die Rechnungen, die auszufiihren waren, 
um in der Nachbarschaft des Keplerkreises K° der mittleren Bewegung 
(65) n=p/q, p,qganz,(p,q)=—1, 


dieser p — gq —J mal syndosch durchlaufen, die (zur Syzygienachse sym- 
metrischen) periodischen Bahnen zu iibersehen, die sich (mindestens) nach 
dem synodischen Umlauf 27J schlieBen. Es ist dann der Periodenquotient 


(66) m=q/(p—q) =q/J und m+1—p/, ((p,J) =1). 


Multiplizieren wir die Differentialgleichung (28) mit J? und setzen 


(67) r/J = 8, 
80 kommt 


Davon sind die im Intervall 0 = 8X periodischen Losungen zu suchen. 
Das zugehérige Intervall 0 7S heift das primitive Intervall. 

Was die Entwicklung der Lésung ¢ der zu (68) gehdrigen Integro- 
differentialgleichung anlangt, so andert sich, da ja die Differentialgleichung 
dieselbe geblieben ist, an dem konstanten Glied erster Ordnung gar nichts. 
Wir haben 


= 2a-*m?(m +1) + Bu + Ecosps + ysin pd+-: - 

= 2a-?m?(m + 1) + Bu + écos (m+ 1)r+ (wm 
nach wie vor, wo jetzt 
3 


(69) 


v). 
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(70) 0 
€sin pb dé = {sin (m + 1)rdr 
0 0 


und £ ein wieder nicht in Betracht kommender Koeffizient ist. 
Setzen wir von vornherein Symmetrie beziiglich der Syzygienachse, d. h. 
7 = 0 voraus, so brauchen wir nur die eine Verzweigungsgleichung 


2nJ 


(71) R= (a/m)** cos pd di = (a/m)*xtJ Yecos (m+ 1)rdr =0 
0 0 


zu betrachten. Darin berechnet sich der Koeffizient von xé in derselben Weise 
wie friiher fiir / nur hat man zum Schlué das Integral 


2nd 21 
(72) cos? (m f cos* pb d§ = Jn. 
0 


Kin Glied mit » allein tritt in (71) fir J >1 nicht auf. Der Koeffizient 
= Von » in Ja6t sich namlich in eine Kosinusreihe nach 
7+ =J6 entwickeln; ein Vielfaches dieses Winkels kann aber niemals gleich 
dem Winkel pé sein, mit dessen Kosinus ¥ in (71) zu falten ist. Denn daf 
ein Vielfaches NJ = p wird, ist bei der Teilerfremdheit von p und J nur fiir 


J =1 moglich. Im ganzen bekommen wir fiir (71) 
(73) R= J*-6m(m +1)KE+- -=E(6pge =0. 


Dazu muf man noch beachten, daB 0 gelten sowie 0 ist. Denn 
fiir hat die zu 07S gehorige Integrodifferentialgleichung eine 
eindeutig bestimmte, in x, regulire analytische Losungsschar. Diese mu 
man bekommen, wenn man von der zu 0 = 7 S 2a gehorigen Integrodifferen- 
tialgleichung ausgeht; deren Loésungen sind ja, da der regulire Fall vorliegt, 
ohne weiteres Lésungen der Lagrangeschen Gleichung fiir die Bahnkurven 
und haben als Kosinusreihen in + wieder wegen der Teilerfremdheit von p,J 
vermoge (70) den Parameter geniigen also der zu 
gehorigen Integrodifferentialgleichung mit 0. 

Die Verzweigungsleichung (73) zeigt, daB die Mannigfaltigkeit der 
symmetrischen Losungen zerfallt ; auch bei festem erhalten wir zwei Lésungs- 
gruppen, entsprechend Bahnen mit der synodischen Windungszahl 1 bezw. J. 

Weitere Rechnungen bediirfte die Bestitigung der Vermutung, daf die 
Zeitspanne JT, = = selbst als primitive synodische Periode von 
Bahnen mit » > 0 nicht vorkommt. 
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SYMBOLIC DYNAMICS. 


By Marston Morse and Gustav A. HEDLUND. 


1. Introduction. In both its qualitative and quantitative aspects, modern 
theoretical dynamics is largely concerned with recurrence and transitivity. 
The qualitative study of dynamics owes its inception to Hill and Poincaré. 
Its development in the past three decades has been dominated by the work of 
Birkhoff, to whom belongs the principal credit for the trend of modern 
research in dynamics. The quantitative (metrical) study of dynamical sys- 
tems is of more recent origin, but with the establishment of the ergodic 
theorems of Birkhoff and von Neumann, its development has been rapid. 

The methods used in the study of recurrence and transitivity frequently 
combine classical differential analysis with a more abstract symbolic analysis. 
This involves a characterization of the ordinary dynamical trajectory by an 
unending sequence of symbols termed a symbolic trajectory such that the 
properties of recurrence and transitivity of the dynamical trajectory are re- 
flected in analogous properties of its symbolic trajectory. For general research 
of this character one may refer to papers by Hadamard, Birkhoff and the 
authors. Papers which are special in that they refer to spaces of constant 
curvature, but which are significant by virtue of the symbolisms involved have 
been written by Artin, Koebe, Martin, Myrberg and Nielsen. Martin and 
tobbins have contributed papers on purely symbolic aspects. 

In the general theory as developed up to the present, the dividing line 
between symbolic and differential methods has not been sharply defined. It is 
a first aim of the present paper to separate the symbolic aspects from the 
differential aspects and to develop the symbolic theory on its own account. 
In this way many interesting and important dynamical questions can be given 
an algebraic formulation quite apart from the classical theories of differential 
equations. 

A symbolic trajectory 7’ is formed from symbols taken from a finite set 
of generating symbols subject to rules of admissibility which are dependent 
on the underlying Poincaré fundamental group. The trajectory 7 taken with 
one of its symbols is called a symbolic element. Symbolic elements are ana- 
logues of line elements on an ordinary trajectory. A metric is assigned to the 
space of symbolic elements and this space turns out to be perfect, compact, and 
totally disconnected. The necessary first theorems on symbolic recurrence and 
transitivity are then rapidly developed in a way suggestive of the differential 


treatment of Birkhoff. 
815 
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We come to the more novel developments of the paper. Recall that the 
recurrency index k(n) of 7 is a minimum number m such that each block 
of m successive symbols of 7’ contains every block of n successive symbols of 7. 
If K(n) exists for each n, T is termed recurrent (cf. Morse [2], p. 94). In 
§ 7 we are concerned with conditions on R(n). <A typical condition is that 
R(n) > 2n for every non-periodic recurrent T. In § 8 we make a special study 
of the Morse recurrent trajectory (Morse [2], p. 95) obtaining R(n) explicitly. 
This is the first and only known evaluation of R(n) for a non-periodic re- 
current trajectory. We give a definition of almost periodicity of symbolic 
trajectories somewhat similar to the ordinary definition of Besicovitch ([1], 
p. 77) and show that the Morse trajectory is not almost periodic. Most of 
the known recurrent trajectories are almost periodic. In § 9 we give general 
methods for deriving new recurrent trajectories from a given recurrent tra- 
jectory. For the Morse trajectory, #(n)/n is shown to have an inferior limit 
of 5.5 and a superior limit of 10. In § 10 we define a recurrent trajectory 
for which the inferior limit of R(n) /n is finite while the superior limit is + «. 

We continue with a study of transitivity. An admissible symbolic sequence 
of the form a,a,a;:-- is termed a ray. We assign the Baire metric (cf. 
Hausdorff [1], p. 101) to the space of rays. Let 6(n) be the minimum length 
of a block which contains a copy of each admissible n-block. A ray R which 
contains a copy of each admissible block is termed transitive. If FR is transitive, 
let (7) be the minimum length of an initial block cf H which contains a copy 
of each admissible n-block. We generalize a theorem of Martin [1] and in- 
clude a hitherto unpublished theorem of R. Oldenburger on the impossibility 
that ¢(n) = 6(n) for two successive values of n (n >1). A typical general 
theorem states that the set of transitive rays for which 

lim. inf. | 
(n) 
is a residual set of the second category. 

Symbolic dynamics as the authors conceive it forms one of the three 

divisions 
(1) representation theory, 
(2) symbolic dynamics, 
(3) existence of space forms, 


of the whole theory. The representation theory is concerned with the col- 
ditions on space forms under which trajectories admit a one-to-one symbolic 
representation in terms of which the recurrence or transitivity of the trajectory 
can be determined. These conditions will involve the Poincaré fundamental 
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group of the space and differential conditions such as that of uniform instability 
(cf. Morse [4], p. 64). In (3) one is concerned with the existence of space 
forms satisfying the conditions discovered in (1). The questions involved are 
rather deep extensions of the Hilbert, Koebe theory of spaces of negative curva- 
ture (cf. Hilbert [1], and Koebe [1]). A simple typical theorem is that there 
exists no two-dimensional Riemannian manifold of the topological type of the 
torus satisfying the condition of uniform geodesic instability. The bearing of 
such studies on questions of topological and metric transitivity will be made 
clear in later papers. 

2. Symbolic sequences. Having seen the significance of symbolic se- 
quences we shall now develop their theory on an independent basis. We start 
with a finite set of » distinct symbols termed generating symbols. We suppose 
that » > 1. If a» represents a generating symbol, sequences of the form 


(2. 1) 
(2. 2’) Br Great 


will be termed indexed sequences (written J-sequences). More particularly 
[-sequences of the forms (2.1), (2.2) and (2.3) will be termed I-trajectories, 
I-rays, and I-blocks respectively. It will be convenient to term the generating 
symbol represented by a» the value of a,. The I-blocks (2.3) will be said to 
have the length n and will frequently be termed n-blocks. 

Suppose A and B are J-sequences of symbols ad and bm respectively. We 
shall regard A and B as identical and write A = 8B if and only if the ranges 
of the indices n and m are the same and a, and by, have the same value. Let 
rbe an integer, positive, negative or zero. Suppose that the ranges of n and m 
are such that symbols a, and b,, correspond in a one-to-one manner with dy 
corresponding to by... If then ad» and b,,, have the same value we shall say 
that A and B are similar and write A~ B. We shall also write 

B=D,A, A= D_,B. 
lfr—0, D,.A =A, but in general D,A is different from A when rr 0. 

The class of J-sequences similar respectively to an I-trajectory, J-ray, or 
I-block will be termed a trajectory, ray, or block, and will be represented by 
the sequence of values involved without indices. For example if 1 and 2 are 
generating symbols, the unending sequence 


is a trajectory but not an J-trajectory. 
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Corresponding to an /-sequence A we shall refer to J-subblocks or I-subrays, 
We shall thereby mean subsets of consecutive symbols of A ‘ indexed as’ in 4 
with successive indices differing by unity. Thus if A is of the form 


Ay As As, 


a2d; is an [-subblock, but a,a, is not an J-subblock. 

In classical dynamics an element based on a mction 7 is a point P of 7 
together with the direction tangent to 7’ at P. The symbolic analogue of such 
an element is defined as follows. Let (a) be an I-trajectory and a, the r-th 
symbol of (a). The I-trajectory (a) taken with a, will be termed an J-element 
based on (a) at a,, and will be denoted by H(r,a). We term r the indez of 
E(r,a) and a, the preferred symbol. Let H(s,6) be an J-element of index s 
based on an J-trajectory (b). Weshall regard H#(r,a) and EH(s, b) as identical 
and write 

E(r, a) = E(s, b) 


if and only if (a) =(b) and r=s. We shall say that H(r,a) is similar to 
E(s,b) and write L(r,a) ~ E(s,6) if 


D_,(a) = D_s(b). 


The class of J-elements similar to a given J-element H(1r, a) will be termed a 
symbolic element E. Two symbolic elements are regarded as identical if the 
corresponding classes of J-elements are identical. We say that F is based on 
(a) and represented by E(r, a). 

We shall give an illustrative example. Let (a) be an J-trajectory in which 
a equals 1 or 2 according as n is even or odd, and let (6) be an J-trajectory 


in which dn. Evidently 
E(0,a) E(0, db), E(0,a) ~ #£(1, 6). 
More generally if m is even and m is odd, 
E(n, a) ~ E(m, b) 


and these similar elements serve to define a symbolic element H. There are 
just two different symbolic elements based on (a) or (6). 


3. The space of symbolic elements. We shall assign a distance Fh, 


to any two symbolic elements £,, #,. To that end we say that an I-block of 


the form 


Crom Orem 


ig 
a 
if 
( 
Ir 
E 
to 
(3 
(3 
(3 
Re 
(3 
In 
tri 
Tec 
Ev 
C0 
iro 
of 
thi 
Suy 
can 
def; 
(3, 
as 
eXis 


SYMBOLIC DYNAMICS. 819 


is centered at c,. Let H, and EH, be represented by indexed elements E(r, a) 
and L(s,6) respectively. If #,AH2, let m be the length of the longest 
I-blocks of (a) and (b) which are similar and centered ata, and bs respectively. 
We then set 


(3. 1) E,E, E(r,a)E(s,b) =1/m (m0). 


In the special case where m = 0 we regard the distance (3.1) as infinite. If 
H,= H., the distance (3.1) is taken as zero. It is clear that the distance 
is independent of the particular J-elements H(r,a) and E(s,b) chosen 
to represent H, and /, respectively. 

The distances so defined satisfy the usual metric axioms: 


(3. 2) H,H,=0 if and only if #,= Ff, 
(3. 3) ELLE, = 
(3. 4) S + 


Relation (3.4) follows from the stronger relation 
(3. 5) S max (LL, 


In case any one of the distances involved in (3.5) is 0 or «, (3.5) holds 
trivially as is readily seen. Excepting these cases, let h, k, and m be the 
reciprocals of H,H#;, H,H, and EH,H; respectively. Consider the case k = m. 
Evidently then h = m so that h = min(k,m). The latter relation yields the 
condition 
1/h S max(1/k, 1/m) 

from which (3.5) follows. The case m = k leads to the same result. 

Recall that a metric space M is compact if each infinite sequence of points 


of M contains at least one subsequence which converges to a point of M. With 
this understood we shall prove the following. 


THEOREM 3.1. The space M of all symbolic clements 1s compact. 


Let F, be the n-th element in an arbitrary sequence of symbolic elements. 
Suppose H, is based on the J-trajectory (a"). Without loss of generality we 
can suppose that the preferred symbol in (a") has the index 0. We shall 
define a doubly infinite matrix of J-trajectories 


(3.6) Ty 


a8 follows. Since there is at most a finite number of generating symbols there 
exists an infinite subset of the symbols 


a,” (n = 1, 
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which have the same “ value” ¢ . Let the J-trajectories 7',; be a subsequence 
of the J-trajectories (a”) in which do” has the value cy. Proceeding inductively 
we suppose that the first / — 1 rows of the matrix (3.6) have been determined, 
The k-th row shall then be a subsequence T;.; of the (& — 1)-st row such that 
the symbols of 7; of index k — 1 have a common value cx_1, while the symbols 
of index 1— sk have a common value ¢,-;. 

Let (c) be the I-trajectory 


Then (c) has an I-block of length 2k — 1, with center at co, in common with 
Tx. Let & be the element of index 0 based on Tm, and let € be the element 
of index 0 based on (c). Then 
so that the elements €; converge to €. Moreover the elements &; are a sub- 
sequence of the given elements €,, and the proof of the theorem is complete. 

A metric space is termed totally disconnected if its connected subsets all 
reduce to points. Cf. Hausdorff [1], p. 152. Recall also that if C is a con- 
nected set and d is a distance assumed by a pair of points of C, there exists 
a pair of points of C which assumes an arbitrary distance between 0 and d. 
In the case of the space M of symbolic elements, the distances take on only 
the discrete values 1/n, so that the space M is totally disconnected. 

Finally, the space M is perfect. We already know that M is compact. To 
prove that M is perfect it is necessary to show that in every neighborhood of 
an indexed element F(0,a) there is an J-element H(0,0) not similar to 
H(0,a). To that end we take (b) so that it has a prescribed block A of (a) 
centered at a), in common with (a), but that all longer blocks of (a) with 
centers at a differ from corresponding blocks of (b). The element /(0,)) 
will not be similar to #(0,a) but will have an arbitrarily small distance from 
F(0,a) if the block A is sufficiently long. We accordingly have the fol- 


lowing theorem. 


THEOREM 3.2. The space of all symbolic elements is compact, perfect 


and totally disconnected. 


By virtue of a well known theorem in point set theory (cf. Hausdorff [1], 
p. 160), Theorem 3.2 has the following corollary. 


CorotLary. The space of all symbolic elements is the homeomorph of the 


Cantor, perfect, nowhere dense, linear set. 
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4, The space of rays and the space of I-trajectories. Let R, and R, be 
rays with indexed representations of the form 


Ar 


li Ry A Re, the rays R, and Rk, have common initial blocks of finite maximum 
length m. With Baire (cf. Hausdorff, p. 101) we then set 


1 
Ry Rk. = 
lf R, =Rf., we set k,R,—0. The distance so defined satisfies the usual 
metric axioms including a stronger triangle relation similar to (3.5). One 
readily sees that the space of rays is compact, perfect and totally disconnected. 
This distance function in the space of rays is similar to that previously 
defined in the space of elements. We define a distance between J-trajectories 
which is based on an entirely different concept and is analogous to the Besico- 
vitch metric of the theory of almost periodic functions (cf. Besicovitch [1], 
p. 73). 
Let (a) and (b) be J-trajectories. Let | a; — 0; | be 0 if a; and b; have 
the same symbolic value, and 1 if this is not the case. We then set 


[a, = lim. sup. | ag — |. 

It is easily shown that this distance function satisfies the usual metric axioms 
with the exception that [a,b] 0 does not imply (a) = (6b). A simple 
example proving this last statement is obtained by letting all the symbols of 
(a) be 1, and all those of (b) be 1 with the exception of the symbol of index 0 
which shall be 2. 

If (a’) ~(a) and (b’) ~(b), it will not in general be true that 
[a’, b’] = [a,b], but corresponding to (a’) there always exists a (b’) ~ (b) 
such that the equality does hold. Recalling the notation of § 2, let D,(a) 
denote the I-trajectory (a’) in which a’j,,—=a;. Thus D,(a) ~ (a). Fur- 
thermore, if (a’) ~ (a), there exists an r such that D,(a) = (a’). We shall 


establish the following lemma. 
Lemma 4.1. Jf (a) and (b) are two I-trajectories and r is integral, 
(4.1) [D,(a), D-(b)] = [a, 6]. 


Since D,(D_-(a)) = (a), it is sufficient to prove (4. 1) under the assump- 
tion that r > 0. Denoting D,(b) by (b’) we have 
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1 
1 +n 
bi; 
Since 0 = | a; — b; | = 1, the last two terms approach 0 as n becomes infinite, 
Thus 
li b’ 4 li 


which is the stated result. 

Let T be a trajectory and let (a) be an I-trajectory representing 7’. Given 
e > 0, will be said to admit the «-translation r if [D,(a),a] From 
(4.1), this definition is independent of the J-trajectory chosen to represent 7’. 
Any trajectory admits the e-translation 0, and this will be termed the trivial 
e-translation. 

As in the theory of almost periodic functions, a set of integers pot ae 
<M is termed relatwely dense if the numbers |  —Ti-|, 
1—=0,+1,---, form a bounded set. We define the trajectory T as almost 
periodic if given e > 0, T admits a relatively dense set of e-translations. A 
periodic trajectory is evidently almost periodic. We shall presently define 
recurrent trajectories and shall use the notion of almost periodicity in the 
classification of such trajectories. Most of the known examples of recurrent 
trajectories turn out to be almost periodic, but the Morse recurrent trajectory 
of § 8 is not almost periodic, as we shall see. 


5. Restricted sequences. In the applications to dynamical theory the 
sequences admitted are not formed freely from the generating symbols, but 
are subject to certain limitations. The simplest of these restrictions may be 
called the restriction of inverses. In this case the generating symbols form a 
set of the form 


in which a;* is termed the inverse of a;. In this case a sequence is inadmissible 
if a symbol and its successor are inverses. In other cases a circular order ( 
of the generating symbols may appear and an integer r, with the restriction 
that p-blocks in the order C with p >, are to be excluded. Another type of 
restriction prohibits successive blocks of certain definite forms. 

In general problems restrictions of all of these types will appear. What 
these restrictions are in any problem depends upon the underlying group theory 
and topology. A detailed study of these restrictions will appear in a later 
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paper. In the present paper we are fortunately able to abstract the essentials 
of these conditions as follows. 

Certain blocks of the generating symbols will be distinguished and termed 
admissible. A ray or trajectory all of whose subblocks are admissible will be 
termed admissible as well as any element based on an admissible trajectory. 
The conditions of admissibility on blocks are as follows. 


(a). A block is admissible if and only if all its subblocks are admissible. 

(8). Corresponding to any admissible block A there exist at least two 
generating symbols e, dependent at most upon the last two symbols of A and 
such that Ae is admissible. A similar condition holds replacing last by first 
and Ae by eA. 

(y). Corresponding to any two admissible blocks A and B there exists 
at least one generating symbol e dependent at most upon the last two symbols 
of A and the first two symbols of B and such that AeB 1s admissible. 

(8). There exist at least two generating symbols « and B such that all 
blocks formed from « and B alone are admissible. 


These conditions are satisfied when there are at least two generating 
symbols and their inverses and the only restriction is the restriction of inverses. 
They are also satisfied by the well known Nielsen symbolism (cf. Nielsen [2], 
pp. 211-217) for surfaces of genus p> 1. Various symbolisms developed by 
the authors satisfy these conditions, including one which replaces the Nielsen 
symbolism to advantage. 

We state the following theorem. 


~ 


THEOREM 5.1. The set of admissible elements form a compact, perfect, 
totally disconnected subset Q of the space M of all elements. 


That © is compact may be seen as follows. Let H, be a sequence of ad- 
missible elements based respectively on trajectories ¢,. Let H be a limit element 
of the sequence Z,. Suppose that H is based on the trajectory ¢. Each block 
int must appear in some trajectory ¢t, and accordingly be admissible. Hence 
tand # are admissible, and 2 is compact. 

To show that © is perfect let / be an arbitrary admissible element. Repre- 
sent by an I-element H(0,a). Let A be a (2n + 1)-block of (a) centered 
at dy. By virtue of admissibility condition (8), the block A can be extended 
to form an admissible J-trajectory of the form 


where the symbols e; and e;, 1 > n, can be successively chosen in two different 
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ways. Let L’ be an element of index 0 based on (5.1). With the freedom 
of choice of the symbols e; and e_, it appears that the elements EH’ include 
elements different from H. But these elements also include elements arbitrarily 
near # since A is arbitrarily long. 

The set Q is accordingly perfect. It is totally disconnected since the space 
M is totally disconnected, and the proof of the theorem is complete. 

A theorem similar to the above is true of the set of admissible rays, taking 
the metric of rays into account. Unless otherwise stated, the only trajectories 
and rays which will be used will be those which are admissible in the sense of 
this section. 


6. Limit trajectories and minimal trajectories. Let ¢ be a trajectory 
represented by an I-trajectory (a). If there exists a positive integer r such that 


== Oy 0, +1, + 2,- - -), 


/ will be said to have the period r. A trajectory of period 7 is determined by 
each of its r-blocks. There are accordingly at most countably infinite many 
distinct periodic trajectories. 

Let = be an arbitrary set of admissible trajectories. A trajectory ¢ will 
be termed a limit trajectory of = if the elements based on trajectories of 3 
include among their limit elements at least one element based on ¢t. The set 
> can consist of a single trajectory, for example of the trajectory ¢ given by 


The trajectory - - - 222. - - is a limit trajectory of the trajectory (6.1). 

A periodic trajectory has no limit trajectory because it bears at *aost a 
finite number of elements. Let ¢ be a non-periodic trajectory, and (a) an 
[-trajectory representing ¢. Any two J-elements with indices n and r respec- 
tively based on (a), with n >vr, are dissimilar. For a similarity relation 
between these two elements would imply that (a) had the period n —r, con- 
trary to hypothesis. The elements based on ¢ with positive indices must then 
have at least one limit element, and we term such elements positive limit 
elements of t. Negative limit elements are similarly defined. 

A trajectory can have more than one distinct limit trajectory. Letting 


a" stand for a block a- - -a of length n we see that a trajectory of the form 
has the trajectories - --a@aa---and:- - BBB: - - as limit trajectories. More 


generally let the set of admissible blocks be enumerated in the form 


Ao, A;,° 
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If generating symbols é, and b» are successively chosen in accordance with 
block condition (y) the trajectory 


will be admissible. It will have every admissible trajectory as a limit trajectory. 
The elements based on the trajectory (6.3) are everywhere dense in the set © 
of admissible elements. Such a trajectory is termed transitive. 

In the remainder of this section we shall be concerned with a fundamental 
type of limit trajectories termed minimal trajectories. The dynamical counter- 
part of such trajectories was first introduced by Birkhoff [1]. Birkhoff also 
developed their general theory. In this connection the symbolic method was 
first used by Morse ([1], [2]) to establish the existence of non-periodic recur- 
rent motions. Theorems 6.1 and 6.3 of this section and Theorem 7. 2 of the 
following section are the abstract symbolic analogues of dynamical theorems of 
Birkhoff. The proofs presented here are independent of the theory of dif- 
ferential equations. 

Let s be an admissible trajectory and n an arbitrary positive integer. The 
number of different n-blocks of s is finite. It will be denoted by P,(n) and 
termed the n-th permutation number of s. If t is a limit trajectory of s, 


(6. 4) P,(n) = Pi(n). 


A trajectory s such that the equality holds in (6.4) for every n and every 
limit trajectory ¢ of s will be termed minimal. 

We understand that periodic trajectories are minimal. They satisfy the 
above definition vacuously since they have no limit trajectories. We continue 


with a proof of the following theorem. 


THEOREM 6.1. Among the positive (or negative) limit trajectories of 
anadmissible non-periodic trajectory T there is al least one admissible minimal 
trajectory. 

We shall give the proof for the case of positive limit trajectories. To that 
end we shall define certain symbols inductively as follows: 

H, =the set of positive limit trajectories of 7’. 


p, =the minimum of P;(1) for ¢ in Ho. 
H, =the subset of H, for which P:(1) = py. 


pn =the minimum of P;(n) for ¢ in Hy. 
H,, = the subset of Hy_, for which P:(”) = pn. 
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The set Ho is not empty since 7 is non-periodic. Hence H,» is not empty. 


Moreover 


We continue with a proof of the following statement. 


(a). The set H, is closed in the sense that it includes its limit trajectories, 


That H, is closed follows from the definition of limit trajectories. Pro- 


ceeding inductively we shall assume that Hyn-_, is closed and prove that H, is 
closed. In case the set H, has no limit trajectories, Hy is certainly closed, 
Suppose then that H, has a limit trajectory ¢. To show that ¢ is in Hn, recall 
that Hy», © Hn and that ¢ is accordingly a limit trajectory of Hn. Since 
Hy, is assumed to be closed ¢ is in H,_,. By virtue of the definition of py, 


(6. 5) Pi(n) = pn 


On the other hand, ¢ is a limit trajectory of trajectories of Hn», and the n-th 
permutation numbers of trajectories of Hy, equal pn. . Hence 


(6. 6) P:(n) S pn. 


Hence the equality prevails in (6.5) and (6.6), and ¢ isin Hy. Thus Ay is 
closed. 

The sets H, form a decreasing sequence of closed sets. It follows from 
the compactness of the space of admissible elements that the intersection H 
of the sets H» is closed and non-void. Let s be a trajectory of H. If s is 
periodic, s is minimal and the proof is complete. If s is not periodic, let ¢ be 
a limit trajectory of s. Since ¢ and s are both in H, for every n, ¢ and s have 
the same n-th permutation number for every n. Hence s is minimal, and the 
proof is complete. 


THEOREM 6.2. If s is a minimal trajectory and t a limit trajectory of 8, 
then t is a minimal trajectory and s a limit trajectory of t. 


Every n-block of s is in ¢ since s is minimal, so that s is a limit trajectory 
of t. To show that ¢ is minimal, let r be one of its limit trajectories. Then 
r is also a limit trajectory of s. Since s is minimal, the n-th permutation 
numbers of ¢ and 7 equal that of s, and so equal each other. Hence ¢ is minimal, 
and the proof is complete. 

The set of limit trajectories of a trajectory ¢ will be termed the derived 
set of t. If s is a non-periodic minimal trajectory, each member of its derived 
set S has s as a limit trajectory, and hence has the same derived set. That 
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this property is characteristic of non-periodic minimal trajectories is implied 
by the following theorem. 


THEOREM 6.3. A set S of trajectories which is the derived set of each 
of its members consists of non-pertodic minimal trajectories. 


Since the derived set of a periodic trajectory is empty, no trajectory of 
the set S can be periodic. 
If s is a trajectory of 8 and ¢ is a limit trajectory of s, 


(6. 7) P,(n) Pi(n). 


But ¢ is in S and thus s is a limit trajectory of ¢, so that the equality sign 
must hold in (6.7). Hence s is minimal and the proof is complete. 

The elements based on trajectories of § form a compact, perfect, totally 
disconnected set H. This is behind an earlier result of Morse that there exist 
discontinuous recurrent motions. Cf. Morse [2], p. 99. The set H is the 
homeomorph of the Cantor, perfect, nowhere dense, linear set. In particular 
H has the power 8 of the continuum. Since there is at most a countable 
infinity 8_ of elements based on each trajectory of S we can conclude that the 
trajectories of S have a power a such that Sa—w8. But 8.a =a (cf. Haus- 
dorff, pp. 30-31) so thata—=8. The set 8 thus has the power of the continuum. 

We shall refer to the following definition. A set of trajectories which is 
the derived set of each of its members or which reduces to a single periodic 
trajectory is termed a minimal set of trajectories. 


7. Recurrent trajectories. Let h be an admissible sequence, finite or 
infinite, and n a positive integer. A least integer m (if any exists) such that 
each m-block of h contains every n-block of h will be called the n-th recurrency 
index Ry(n) of h. If h is a ray or a trajectory and Ra(n) exists for each n, 
i will be termed recurrent and Ry(n) the recurrency function of h. 

We begin with the following theorem. 


THEOREM 7.1. If s is a recurrent trajectory and t is a limit trajectory 
of s, then ¢ is recurrent and s is a limit trajectory of t. Moreover 


R,(n) = Ri(n). 


Each block of ¢ appears in s since ¢ is a limit trajectory of s. Let A be 
an arbitrary n-block of s. A copy of A is found in each block of s of length 
R,(n), and hence in each block of ¢ of length R,(n). Hence R:(n) exists with 


(7.1) Ri (n) SR, (n). 
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Since each block A of s appears in ¢, s is a limit trajectory of 4. The rdles 
of s and ¢ can then be interchanged so that the equality prevails in (7.1), 
The proof of the theorem is complete. 

The following theorem is the symbolic analogue of a theorem of Birkhoff. 
Cf. Birkhoff [2], p. 199. 


THEOREM 7.2. A necessary and sufficient condition that a trajectory s 
be recurrent is that it be minimal. 


We begin by assuming that s is minimal but not recurrent, and seek a 
contradiction. If s is not recurrent, there exists a block U of s with the 
following property. There exists a sequence A, of blocks of s whose lengths 
become infinite with n and which do not contain U. Without loss of generality 
we can suppose that the length of A, is odd. Let #, then be the element based 
on s with preferred symbol at the center of A». Since the space of admissible 
elements is compact there exists a subsequence of the elements #, which con- 
verges to an admissible element H. Let t be the trajectory on which E£ is based. 
It is clear that ¢ does not contain the block U. Hence ts. It follows that 
¢ is a limit trajectory of s. Since s is minimal, ¢ and s must then contain the 
same blocks. From this contradiction we infer that s is recurrent. 

It remains to show that s is minimal if recurrent. If s is periodic, s is 
minimal. Assume then that s is not periodic and let S be the derived set of s. 
It follows from Theorem 7.1 that S is the derived set of each of its members. 
We conclude from Theorem 6.3 that s is minimal, and the proof is complete. 

It is natural to ask what limitations on recurrency indices (n) and 
permutation indices P(n) are imposed by conditions of periodicity or recur- 
rency. We give a partial answer in the following lemmas and theorems. 


Lemma 7.1. If P(n) is the number of different n-blocks in a trajectory 


t, P(n+1)=P(n). 
Let the different n-blocks in / be enumerated in the form 
(7. 2) B; [i 


and let e; be the successor of B; in some position in ¢. The (n + 1)-blocks 
Bye; are all different, and the lemma follows immediately. 

If (a) is an indexed trajectory, a block B of (a) whose first and last 
symbols are a, and a» will be denoted by [r,m]. We shall term r the first 
index of B and m the final index of B. 


THEOREM 7.3. A trajectory t for which P(m +1) = P(m) for some m 
has a period » = P(m). 
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Let (a) be an indexed representation of ¢. There are P(m) dissimilar 
m-blocks B; in (a). The elements immediately following copies of B,; in (a) 
must always be the same. Otherwise there would be at least P(m) + 1 dis- 
similar (m + 1)-blocks. Similarly the elements immediately preceding copies 
of B; must always be the same. 

Consider the blocks B=[1+1,1+m]. Since there are just P(m) 
dissimilar m-blocks, for some 1 there must exist a similarity relation of the form 


in which 0< w= P(m). We shall show that (a) has a period w. We begin 
with an inductive proof of the relations, 


(7. 4) Qisr = (r =1,2,:°°). 


That (7.4) holds for r—1,2,- --,m follows from (7.3). Assuming 
that (7.4) holds for r=1,2,- --,n =m, we shall prove that (7.4) holds 
forr—n-+1. The m-blocks with final indices 1 + and wo respec- 
tively are similar by virtue of our inductive hypothesis. It then follows from 
the principle stated in the first paragraph of this proof that (7.4) holds for 
r=n-+1. 

It follows similarly that = disr10 for r= 0. We thereby make use 
of the fact that two similar m-blocks must be preceded by symbols with equal 
values. One starts with the similarity relation (7.3) and proceeds inductively. 
We conclude that (a) has the period w, and the proof is complete. 


CoroLuaRy. For a non-periodic trajectory P(m-+ 1) > P(m) for each 
m> 0. 


There exist trajectories for which P(n+ 1) =—P(n) +1, so that the 
relation P(n +1) > P(n) of the corollary is in a sense the best possible. 
An example of a recurrent trajectory for which P(n+ 1) = P(n) +1 for 
each n > 0 will be given in a later paper. 


Lemma 7.2. If ¢ is a trajectory which involves p generating symbols and 
has no period less than w, then 


(1.5) P(n) =n+p—1 


for all values n for which P(n) <w. 

Relation (7.5) is clearly true when n= 1. From the preceding theorem 
P(m+-1) > P(m) for each m for which P(m) <. Relation (7.5) follows 
inductively, 


4 


(7. 3) [t+ m] ~[t+1+o,1+ m+ o], 
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THEOREM 7.4. If t is a non-periodic trajectory involving p generating 
symbols, 
P(n)=n+p—1. 


Lema 7.3. If t is a recurrent trajectory with no period less than o, 
then 


(7. 6) 2 P(n) + 2722n+ p—1 
for each n such that P(n) < o. 


Suppose that P(n) <w and set R(n) =m. Since ¢ is recurrent, each 
m-block B contains each of the P(n) n-blocks of t. Hence 


(7. 7) m=P(n)+n—1. 


We shall exclude the equality in (7.7). 

Suppose B has the form [r+ 1,7 -+ m] in an indexed representation (a) 
of t. Suppose m = P(n) +n—1. Then no two n-blocks of B are similar. 
In particular the initial n-block A of B does not appear twice in B. But 
R(n) =m so that A appears in the J-block [r + 2,r-+ m-+ 1], and since A 
appears but once in B, A appears with final index r + m+ 1. Since A appears 
with final index as well, = for arbitrary r. Hence ¢ has a 
period 

m—n+1—[P(n) +n—1]—n+1—P(n) <o. 


This is contrary to the hypothesis that ¢ has no period less than w. Hence 
m > P(n) +n—1, and (7.6) follows with the aid of (7.5). 
Lemma 7.3 and Theorem 7. 4 yield the following theorem. 


THEOREM 7.5. If ¢ is a non-periodic recurrent trajectory involving p 
generating symbols, 


(7. 8) R(n) 2 P(n) +072 2n+ p—1, 


where R(n) and P(n) are respectively the recurrency and permutation 1n- 
dices of t. 


It will be convenient to set P(0) —0 for each trajectory 1. 


THEOREM 7.6. If a trajectory t involves w generating symbols and has 
amimimum period ow, 


(7. 9) R(n) Sn+o—1 (n= 1,2,- °°). 
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Moreover there exists a largest integer m such that P(m) < w. For such an m, 
(7. 10) R(n) 2n+yp—1 
(7.11) R(n) = n+o—1 


Relation (7.9) is immediate. To establish the existence of a largest 
integer m such that P(m) <, observe that each integer n > 0 for which 
P(n) < satisfies the relations 


Sn+o—1, 


by virtue of (7.6) and (7.9). It follows that m exists and is at most w — yp. 
Relation (7.10) follows from (7.6). 
To establish (7.11) let n be any integer > m. Then P(n) =o and 


(7. 12) R(n) 2 P(n) + 


Relation (7.11) follows from (7.12) and (7.9). 

Let (a) be an arbitrary J-trajectory. Let n be a positive integer, and let 
m be an integer, positive, negative or zero. For n and m fixed, let H(m) be 
the minimum length of those blocks of (a) with initial index m which contain 
copies of all n-blocks of (a). If no such block with initial index m exists, let 
H(m) =-+ o. If, as m ranges over all integers, the set H(m) is bounded, 
it is clear that the n-th recurrency index R(n) of (a) exists and 


(7.13) R(n) = max. H(m). 


Let r be any integer such that H(r-+ 1) assumes the maximum of H(m). 
The R(n)-block [7 + 1,r-+ R(n)] will then be termed a minimax n-covering 
in (a) or in the trajectory ¢ represented by (a). The length of a minimax 
n-covering B in ¢ is R(n) and the final n-block in B does not otherwise appear 
in B. These two properties are characteristic provided R(n) exists. 

We are led to the following theorem. 


THEoREM 7.7%. If ¢ 1s a trajectory for which R(n+ 1) and R(n) exist, 
R(n+1) > R(n). 


To prove the theorem let B be a minimax n-covering in ¢. Let A be the 
final n-block in B and e the symbol following A. The (n-+ 1)-block Ae does 
not appear in B since A has no second appearance in B. Hence R(n + 1) 
>R(n), as stated. 

It may happen that R(n +1) —R(n) + 1 for infinitely many values 
of n as the recurrent trajectory of the following section shows. 
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8. The Morse recurrent trajectory. In this section we shall recall the 
definition of the non-periodic I-trajectory T' first defined by Morse [2], p. 95, 
and shall give the explicit values of the corresponding recurrency function, 
denoting this function by p(7). 

Recall that the symbols « and 8 can be combined in any way to form 
admissible blocks. We here take « and 8 as 1 and 2 respectively. Set 


A, = 1 = 
(8. 0) 

On+1 Anbn, Dn&n, 


understanding that the blocks anbn and bpd, are to be expanded as blocks of 
1’s and 2’s of length 2”**. The J-trajectory T shall have the representation 


(8. 1) * * 
In T we set 
(8. 2) Col1° ° CoM, 


Note that (8.2) is consistent for all n, since dm is an initial subblock of a, for 
m<n. The symbols c; are thereby defined for 1>0. To complete the 
definition of (8.1) we set 


(8. 3) C_r = Cr-1 (r > 0). 
The subray 
(8. 4) R CoC Co 


of 7 thus starts as follows: 
(8. 5) 1221 2112 2112 1221 2112 1221 1221 2112: --. 


Corresponding to an arbitrary block A, the block which consists of the 
symbols of A in inverse order will be denoted by A-. We note that 


(8. 6”) On == bn = bn”, (n even) 
(8. 6”) On = bn, bn = On", (n odd). 
Relations (8.6) are clearly true when n = 0, and their truth in general follows 


inductively from the relations 


= — Om dn", 


From (8.6) we infer that the 2"-block of (8.1) whose final index is —1 8 
= when n is even and when n is odd. Thus the 2**?-block 
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whose initial index is — 2” ig dnd, when n is even and Bnd, when n is odd. 
Each block of 7’ appears in such a 2”*!-block for n sufficiently large. Since 


= 


each block of 7’ appears in the subray R of (8.4). The recurrency indices 
of 7’ (if they exist) are thus completely determined by this subray. 

Let A be an arbitrary block of 1’s and 2’s. The block obtained from A 
by replacing 1 by 2 and 2 by 1 will be denoted by A’. An easy inductive proof 
shows that 
(8. 7) a'n = dn, == On. 


The trajectories T’ and T,. Let T’ be a trajectory with an indexed repre- 
sentation (@) in which e; = c; for i= 0, and for 1 < 0 equals 1 or 2 according 
as ¢; 18 2 or 1 respectively. The 2”-block of 7’ whose final index is —1 is 
m OF by according as nm is even or odd. The corresponding 2”-block of 7” is 
then @’n = bn or b’n = dn according as n is even or odd. We infer that each 
block of 7” has a copy in the subray R of (8.4). The recurrency indices of 7” 
(if they exist) are then completely determined by ?. Since this is also true 
of T we have proved the following lemma. 


LemMaA 8.1. The recurrency indices of T and T’, if they exist, are equal 
and are uniquely determined by the ray R in (8.4). 


We introduce a trajectory Tn, n = 0, whose generating symbols are dn 
and b,, and which is obtained from T or T” according as n is even or odd, by 
replacing 1 by ad and 2 by bn. It follows from the relations (8.0), (8.3) and 
(8.6) that an expansion of the symbols a, and bn in T, into blocks of 1’s 
and 2’s will yield 7. Unless otherwise stated JT, shall be regarded as a 
trajectory in the symbols a, and by, unexpanded as blocks of 1’s and 2’s. For 
example dnb, will be regarded as a 2-block of Tn, or, if we choose, as a 2"*?- 
block of 7’. 

We shall make use of the following properties of the ray F# of (8.4). 


(a). There are no more than two successwe 1’s or 2’s in R, 
(b). Hach 5-block of R contains the blocks 12 and 21. 
(c). Hach 5-block of R contains either the block 11 or else the block 22. 


To establish (a) we note that each 3-block of R will be found in one of 
the expanded 2-blocks of 7’; that is, in one of the blocks 


ab, — 1221, 1212, 
b,a, = 2112, bib, = 2121, 


and no more than two successive 1’s or 2’s appear in these blocks. 


MARSTON MORSE AND GUSTAV A. HEDLUND. 


To verify (b) observe that a 5-block B which did not contain 12 would 
contain 21 in its first 3-block by virtue of (a). Its final 3-block would then 
be 111 contrary to (a). We infer that B contains 12, and similarly 21. 

To prove (c) we infer from (a) that there can be no more than two con- 
secutive a,’s or b,’s in T,. In particular R cannot contain the block 12121 
since R would then contain the block a,a,a, or 6,b,b, according as 12121 
possessed an even or odd first index. Similarly R contains no block 21212, 
and (c) follows directly. 

Statements (a), (b) and (c) are true if R is replaced by T or T” since 
each block in 7 or T”’ appears in the subray R. 

We continue with the following lemma. 


LemMa 8. 2. The recurrency index p(n) of the trajectory T has the values 


p(1)—8,  p(2)—9, — 11. 


That p(1) = 3 follows from (a). We continue by showing that p(2) S 9. 
To that end consider a 5-block B of 7';. The block B will contain the blocks 


(8. 8) a,b, 1221, bia, 2112, 
by virtue of (b) and either 
(8. 9) 1212 or 2121, 


by virtue of (c). The only 2-blocks in T are 12, 21, 11 and 22. If B, denotes 
the block B expanded in terms of 1’s and 2’s, we see that each 2-block of 7 
appears in By interior to By. Thus the end symbols of By can be deleted and 
the resulting block B’, contains all 2-blocks of T. But any 9-block of 7 will 
contain some block of the nature of B’, so that p(2) = 9. 

That p(2) is not less than 9 follows upon considering the 9-block of 7 
with initial index 10. This block has the form 


12 1221 211, 


and will contain no block 11, if the final 1 is removed. Thus p(2) = 9. 
To show that p(3) = 11, recall that there are no more than two successive 
1’s or 2’s in T so that the only 3-blocks are 


(8. 10) 112, 122, 121, 211, 221, 212. 


Consider the block B, of the preceding proof. It contains the blocks (8.8) 
and at least one of the blocks (8.9). It follows that B, contains each 3-block 
(8.10). But any 11-block of T will contain a By, so that p(3) = 11. 
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That p(3) = 11 follows upon noting that the 11-block of 7’ with initial 
index 3 has the form 
1 2112 2112 12, 


and that removal of the final symbol from this block leaves no subblock of the 
form 212. Hence p(3) = 11, and the proof of the lemma is complete. 

Let T’; be the trajectory obtained from 7” by grouping the symbols of 7” 
into 2-blocks with even first indices, regarding 7’; as generated by the blocks 
a,,b;. It is readily seen that 7’, is formed from the blocks a, 6; exactly as T 
is formed from the symbols 1 and 2 respectively. We shall make use of this 
fact in proving the following lemma. 


LemMMA 8.3. No two r-blocks of T(T’) for which r = 4 are equal unless 
their first indices are equal mod 2. 


It is sufficient to prove the lemma for the case 7 = 4. To that end observe 
that each 4-block of T7(T’) as it stands is in some 6-block A of T(T’) with 
even first index. Let A, be the 3-block of 7,(7’,) determined by A. Since 
p(3) = 11, each 3-block of T,(T’,) appears in the 11-block of 7,(7’,) with 
first index 0. The lemma follows upon verifying its truth for the 22-block 
of T with first index 0. For each 4-block of 7 appears in this 22-block with a 
first index which is unchanged mod 2. 

The following lemma is useful. 


Lemma 8.4. If p(m-+1) exists and m= 2, 
p(2m) S 2p(m + 1) —1. 


We need only to show that each block B of T of length 2p(m-+1)—1 
contains a copy of each 2m-block x of 7’. We distinguish two cases. 


Case 1. The first index of x is even. Here x determines an m-block of 
I, and so appears in each expanded p(m)-block of T;. Hence x appears in 
each [2p(m) + 1]-block of T. Thus p(2m) exists and 


2p(m) +1= p(2m). 
But 


p(m+1) =p(m) +1, 
in accordance with Theorem 7.7%. Hence 
+ 1) —1= %(m) +1= p(2m). 


This completes the proof in Case 1. 
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Case 2. The first index of x is odd. Let c’ and c” be respectively the 
elements of 7’ which precede and follow the above block B. If B has an odd 
first index, c’B has an even first index and a length 2p(m + 1) and so contains 
a copy of each (m + 1)-block of 7. Hence c’B must contain a copy y of z. 
But the index of y is odd in accordance with Lemma 8. 3, so that y does not 
include c’. Hence z appears in B. If B has an even first index, Bc” contains 
a copy y of x. But the index of c” is odd, while the final index of z is even. 
Hence y does not include c” and lies in B. 

The proof of the lemma is complete. 

Since p(3) exists it follows from the lemma that p(4) exists, then p(6), 
p(10), etc. But if p(m) exists, p(m) exists for m <n. We conclude that 
p(n) exists for all n. 

We come to a principal theorem. 


THEOREM 8.1. For m= 2 the recurrency function p(m) of T' satisfies 
the relations 


(8. 11) p(2m) = 2p(m + 1) —1, 
(8. 12) p(2m +1) = 2p(m +1). 


Proof of (8.11). It will be sufficient to show that the equality prevails 
in Lemma 8.4. To that end let g be a minimax (m + 1)-covering in 7’, as 
defined in §7. The length of g is p(m-+1), and the final (m + 1)-block 
of g appears only once in g. Let x be an arbitrary 2m-block of 7 of odd first 
index, and let y be an (m + 1)-block of 7, which when expanded contains 2. 
When m = 2, x appears only as an interior block of the expanded y in ac- 
cordance with Lemma 8.3. Since a 2-block of T of even index is uniquely 
determined by either of its symbols we see that y is uniquely determined by z. 
It thus requires copies of all (m + 1)-blocks of T, to cover all 2m-blocks of 7 
of odd index. In particular it will require a copy of the final (m + 1)-block 
of g. As a block of 7’, g can then be shortened on the right by at most one 
symbol and, when expanded, still cover all 2m-blocks of 7. The equality 
accordingly holds in Lemma 8.4, and the proof of (8.11) is complete. 


Proof of (8.12). Recall that p(2m + 1) exceeds p(2m). It follows then 
from (8.11) that 
(8. 13) p(2m +1) = 2p(m+1). 


It remains to show that the equality prevails in (8.13). To that end let J 
be a 2p(m + 1)-block of 7, and let c’ and c” be respectively the symbols pre- 
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ceding and following B. Let z be an arbitrary (2m-+1)-block of T. We 
distinguish between several cases: 


Case 1. The index of c’ is even, that of x odd. 
Case 2. The index of c’ is even, that of x even. 
Case 3. The index of c’ is odd. 


Case 1. The block w appears in some expanded (m-- 1)-block of 7). 
Hence x has a copy y in c’B, for the first 2p(m + 1)-block of c’B determines 
ap(m-+1)-block of 7;. But the index of y is odd so that y does not include 
c’ and thus lies in B. 

Case 2. Here c” has an odd index and the last 2p(m + 1)-block of Bc” 
determines a p(m-+ 1)-block of 7,. Hence x has a copy y in Be’. But the 
final index of y is even so that y les in B. 

Case 3. In this case B determines a p(m + 1)-block of 7’;, hence con- 
tains a copy of each (m-- 1)-block of 7, and accordingly a copy of z. 


Thus a copy of x appears in B in all cases. The equality accordingly 
prevails in (8.13), and the proof of (8.12) is complete. 

With the aid of the preceding theorem we can derive a general formula 
for p(n) when n= 3. To that end we first show that integers n = 3 can be 


represented in the form 
(8.14) N= 2" + p, (p = 2° +1), 


where r= 0 and vr and p are uniquely determined by n. The smallest and 
largest values of » given by (8.14) for a fixed r are respectively 


ar 42 


and when r is replaced by r + 1 the smallest value of n is 


Qrtt 2, 
Thus as r ranges over the values 0,1,2,- - - and p over the corresponding 
values listed in (8.14), n takes on each of the integers 3,4,- - - once and 


only once. We continue with the proof of the following theorem. 


THEOREM 8.2. For values of n of the form (8.14) the recurrency func- 
tion p(n) of T has the value 


(8. 15) p(n) =10-2° + p—1. 


When + = 0 in (8.14) p=2 and n=3. We have seen that p(3) = 11, 
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so that (8.15) holds when n= 3. When r=1, we see that p—2 and 3 
while »—4 and 5 respectively in (8.14). Relation (8.15) affirms that 
p(4) = 21 and p(5) = 22, and these values are correct as follows from (8. 11) 
and (8.12) respectively upon setting m = 2. We proceed inductively assuming 
that (8.15) holds for a fixed r and the related values of p, and seek to prove 
that 


(8. 16) p(u) = 10-2144. 
where ; 
(8. 17) pe (q==2,- °°, 


We distinguish between the cases where q is even and q is odd. 


Case 1. q even. Here q has the form 2s and 


p= 2(2"+ 8). 
Upon setting m = 2"-+-s in (8.11) we find that 
(8. 18) = 2p(2" +841) —1. 
Upon referring to (8.17) we see that s+ 1 ranges over the values 
(8. 19) 2°41. 


When n = 2°-+8s+1, p=s-+1 in (8.14) and the range (8.19) is also the 
range prescribed in (8.14). We can accordingly use our inductive hypothesis 
to infer that 

(8. 20) + =10-2° + p—1—10- 2+. 


Upon returning to (8.18) we conclude that 
p(w) = 2(10- 2" + s) —1—10-2 4 g—1. 
Relation (8.16) thus holds when q is even. 
Case 2. q odd. Let q and yp be values of g and p given by (8.17) with 


gq odd. Corresponding to q and yp there exists a pair of even integers q,, 4 
again satisfying (8.17) with p, = 2m and 
q=ut+l, p= 2m +1. 
Upon using the result established under Case 1, 
= p(2m) = 10-27" + gq, —1. 
But 
p(2m + 1) =p(2m) +1 


by virtue of (8.11) and (8.12), so that in Case 2 
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p(#) = 10-2" +4 g, = 10-274 4 g —1. 


Thus (8.16) holds in Case 2, and the proof of the theorem is complete. 

This explicit determination of p(n) is the first such determination for a 
non-periodic, recurrent trajectory. 

The manner in which p(n) varies with n is illustrated by the following 
table : 

m==1,2, 8, 4, 5, 6, 7%, 8, 9, 10, 11, 12, 18, 14, 15, 16, 17, 18, 

p(n) = 3, 9, 11, 21, 22, 41, 42, 43, 44, 81, 82, 83, 84, 85, 86, 87, 88, 161, 
where the groups underlined are related in an obvious manner. For a given r 
in (8.14) the smallest and largest values of p(n) are respectively 


10-27+ 1, 
We see that for a fixed r, 


19 > 10-2" +15 _ 10-2 + 1-2 


We are thus led to the following theorem. 


IV 


THEOREM 8.3. If p(n) 1s the recurrency function of T, 


= lim. sup. ets) = 10 


2 


(8. 21) lim. inf. (2) = 


and 10 and 11/2 are the limits respectiely of p(n) /n for sequences of values 
of n given by 
= + 2, m= 0, + +). 


If e is a prescribed positive constant and r is a sufficiently large positive 


integer, the ratios 
p(n) 
n 2° p 
assume values within e of each value k& between the limits (8.21). There 
accordingly exists a subsequence of values of n for which 


(p = 2, 3,° +1) 


lim. 


n 
For all values of n, p(n) < 10n, while for each positive e and sufficiently large 
Values of n, 2p(n) > (11—e)n. 
That the trajectory 7’ is not periodic follows from Theorem 7. 6. 
In § 4 a trajectory has been defined as almost periodic if it admits a 
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relatively dense set of e-translations for any positive «. We show that a recur- 
rent trajectory is not necessarily almost periodic by proving that the Morse 
recurrent trajectory admits only the trivial e-translation if «< 1/4. To that 
end we first prove two lemmas. 

Lemma 8.5. If: 1s a sequence of the numbers 0 or 1, and 
p and x are integers, x > 0, then 

n+p Kit 

li = lim. 5; = lim. Bt. 


The first silts follows at once from the vies 


n | 
Sli 


As to the second, let n=«m+r,0=r<k«. Then 


1 n Am Km+r 
s—— + 8; 8; 


where the last two sums are not present if r 0. Each of these last two sums 
approaches 0 as n becomes infinite. It follows that 


lim. su 8; = lim. su 8; 
2n 1 an ] 2 mu 
1 1 Am 1 Ai 
= . sup. = lj 


The proof of the lemma is complete. 

We recall that 7, is the trajectory with generating symbols a, and b, 
obtained from T' or 7’, according as n is even or odd, by replacing 1 by a, and 
2 by b, and that an expansion of the symbols a, and b, in T,, into blocks of 
1’s and 2’s will yield 7. 


Lemma 8.6. If r= 2?q, p= 0, and (c) and (d) are I-trajectories repre- 
senting T and T>, respectwely, 


(¢) ] = [Pa(4), (4) 


It follows from Lemma 4. 1 that the value of [D,(c), (c)] is independent 
of the particular indexed representation (c) of 7’ which we use. We shall find 
it convenient to take (c) as the J-trajectory (8.1). Then D,(c) is an /- 
trajectory (c’) in which c’; =c;_,. Similarly we can suppose that in (4), 
d; in expanded form is given by 
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for (d) is then an indexed representation of Ty. Since r= 2?q, Dg(d) has 
the form (d’) where = = Ci2”-r* * * Ccis1)2?-r-1, in expanded form. 

Since d; and d’; are either a, or bp, it follows from (8.7%) that in their 
expanded form, d; and d’; are either identical, or one is obtained from the 
other by replacing 1 by 2 and 2 by 1. If we set 


(4+1 2?-1 


it follows that 
{d’; — di} = 2? | — di; |. 


With the aid of Lemma 8. 5, we then see that 


m2? 


= lim. SUP. | 
4=-m2” 
= lim. su — | 
= lim.s d’,—d 
= lim. sup. | | 


1 
m->0O m +1, —m 


lim. sup. = | d’; —d, | = [D,(4), (d)]. 


The proof of the lemma is complete. 
We come to the proof of the desired theorem. 


THEOREM 8.4. The Morse recurrent trajectory admits only the trivial 
translation 0 if « << 1/4. 


It is sufficient to show that 
(8. 22) [Dr(c), (¢)] = 4, 0, 


where (c) is again the indexed representation (8.1) of the Morse recurrent 
trajectory. 


Case 1. r odd. Let 


n+3 n+3 
=> | | = > | Ci-r — C4 I. 
i=n i=n 


Since r ig odd, the first indices Of Cn-r€n-rs1€n-rs2Cn-r+g3 ANA Cn€ny1Cns2Cnsg ae not 
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congruent mod 2. It follows from Lemma 8. 3 that these two 4-blocks cannot 
be similar, so that o, = 1. Applying Lemma 8. 5, we find that 


[Dr(c), (c)] = lim. sup. — | 
= lim.s 
= lim. sup. 


8n+1 1; 4=-n 


Since on = 1, it follows that } ou, = 2n+- 1, and hence 


i=-n 


(8.28) [D+(c), (¢)] = lim. sup. ou lim. 4, 


We have proved (8. 22) when r is odd. 

Let (€) be the I-trajectory obtained from (c) by replacing Cn, n < 0, by 1 
if ¢, is 2, and by 2 if c, is 1. Then (é) is an indexed representation of the 
trajectory T’. Since Lemma 8. 3 holds for T’, as well as 7’, a proof similar to 
that given for (c) yields the relation 


(8. 24) [D-(é), (¢€)] = 4, r odd. 
Case 2. reven. Since r~ 0, r = 2?q, where g is odd. From Lemma 8. 6, 
(8. 25) [D;(c), (¢)] = [Da(4), (4)]. 


The symbols of (d) are a» and by, and if in (d) we set a> —1, bp = 2, we 
obtain (c) or (é) according as p is even or odd. Since q is odd, (8. 23) or 
(8. 24) applied to the right side of (8.25) yields 


[Da(4), (d)] =F. 
Combining this inequality with (8.25) we conclude that 
[D-(c), (c)] = 4, r even. 


The proof of the theorem is complete. 
The following corollary is an immediate consequence of the preceding 
theorem. 


Corottary. The Morse recurrent trajectory is not almost periodic. 


Upon distinguishing recurrent trajectories according to whether they are 
almost periodic or not, we can state that recurrent trajectories of the type of 
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Morse are the only ones so far defined which are not almost periodic. In 
particular, the recurrent trajectories defined by Birkhoff ([4], p. 22) are 
almost periodic. The proof of this and the further development of the theory 
of almost periodic trajectories will be given in a later paper by the authors. 


9. The derivation of recurrent trajectories. Having exhibited a non- 
periodic recurrent trajectory 7’, we shall describe four methods of deriving a 
recurrent trajectory from 7’, termed respectively derivation by projection, re- 
duction, association and substitution. We also introduce a general method of 
derivation which includes each of the preceding. If the given trajectory con- 
sists exclusively of the free symbols « and £, these derived trajectories will all 
be admissible. In general they may or may not be admissible. If the given 
trajectory 7 is recurrent, the derived trajectories will be recurrent although 
usually differing in high degree from 7. We suppose that 7’ has an indexed 
representation (c) in terms of generating symbols ,- ° -, dp. 


Projection. The trajectory obtained from (c) by omitting a, whenever 
it occurs in (c) will be said to be the k-th projection of (c). 


Reduction. If the generating symbols consist of a finite set of integers, 
the trajectory obtained by reducing the elements c; mod p will be said to be 
derived by reduction mod p. 


Association. Let m be a positive integer at least 2, and let B, be the 
m-block of (c) whose first index isi. There is at most a finite set of different 
blocks B;. We introduce the J-trajectory 


(B) 


regarding the blocks B; as the generating symbols. 


Substitution. Let A,,: -,Ap be s-blocks of the generating symbols, 
s>0. The trajectory (c*) of a;’s obtained by replacing a; in (c) by Ay and 
expanding will be said to be derived from (c) by block substitution. 


Generalized substitution. Let i'm be a fixed set of integers, posi- 
tive, negative, or zero. Let be a single-valued function (either 
numerical or symbolic) of its arguments 2; as each 2; ranges over the set 
(4,,°- *,day) of generators of (c). We include the convention that a null 
set may be one of the symbolic values of ¢. We introduce the symbols 
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and thereby define an J-trajectory (#). The number of different symbols £, 
is finite. We add the convention that a null set H; is to be dropped from the 
trajectory (#). Derivation by generalized substitution includes all of the 
preceding types of derivation. We shall illustrate this fact for the case of 
derivation by association. Derivation by association is obtained if we set 


Lm) (2, ° » Xm), 
and understand that 


The symbol F; reduces to the block 


as in the case of derivation by association. 

We shall illustrate these processes further by deriving two recurrent tra- 
jectories from the trajectory 7 of the preceding section. We shall use the 
method of association letting m = 2. The generating symbols are the 2-blocks 
of T, namely 11, 12, 22, 21. We shall denote these blocks respectively by 
1, 2, 3, 4 and write a few terms of the resulting trajectory. The symbols with 
indices 1 = 0 written above the corresponding symbols of 7 begin as follows: 


2342412341242342412 
12212112211212212112. 


Upon reducing the first of these trajectories mod 2 we obtain a trajectory 
which contains 
0100010101000100010. 


This trajectory is distinct from 7’ since it has sequences of three equal symbols. 
These new trajectories are recurrent as we shall see. 

A trajectory T’ derived from a recurrent trajectory 7 by projection or 
reduction is clearly recurrent. The trajectory 7’ may however be periodic 
when 7’ is non-periodic. The facts for the case of derivation by association 
are otherwise, as the following theorem shows. 


THEorEM 9.1. If (c’) is a recurrent I-trajectory derwed from a recur- 
rent I-trajectory (c) by association in r-blocks, and R’(n) and R(n) are the 
recurrency functions of (c’) and (c) respectively, then 
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(9.1) R’(n) = R(n+r—1)—r+1. 
Moreover (c’) is periodic if and only if (c) is periodic. 

A block [& +-1,4-+ n] of (c’) is determined by the (n + 7— 1)-block 
(k+1,4-+ of (c) and conversely. We term these blocks corre- 
sponding. Hach (n+ 1*—1)-block of (c) appears in each R(n + r—1)- 
block H of (c). But H corresponds in the above sense to a block (c’) of length 


(9. 2) R(n+r—1)— (r—1), 


while an (nm + 7 — 1)-block of (c) corresponds to an n-block of (c’). Hence 
R’(n) is at most the integer (9. 2). 

To show that R’(n) is at least the integer (9.1), let H be a minimax 
(n+ *—1)-covering in (c), that is an R(n-+17—1)-block of (c) whose 
final (n + 7—1)-block appears just once in H. Let K be the block of (c’) 
corresponding to H. The final n-block of K corresponds to the final (n +- r—1)- 
block of H and appears just once in K, so that R’(n) is at least the integer 
(9.1), and the proof of (9.1) is complete. 

If (c) is non-periodic, R(n) > 2n in accordance with Theorem 7%. 5. It 
follows then from (9.1) that 


R’(n) > 2n + 7r—1. 


The I-trajectory (c’) is accordingly non-periodic, and the proof of the theorem 
is complete. 
The preceding theorem leads to the following extension of Theorem 7. 5. 


THEOREM 9.2. If P(r) is the number of different r-blocks in a recurrent 
non-pertodic I-trajectory (c), then form =r, 


(9.3) R(m) 2 2m-+ P(r) 


Let (c’) be derived from (c¢) by association in r-blocks. If R’(n) is the 
recurrency function of (c’), it follows from Theorem 7. 5 that 


R’(n) = 2n+ P(r) —1. 
But from the preceding theorem we then see that 
(n) Z2n+ P(r) — 1, 


from which (9.3) follows upon setting m = n+ r—1. 


Before coming to the next theorem we shall need the following definition. 
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A definition. Let there be given a finite set S of qg-blocks B,,- - -,B, 
of a set of generating symbols. Let nm be a positive integer less than q. A 
least integer r(n) Sq (if such exists) such that each n-block appearing in a 
block of the set S appears in each r(n)-subblock of each of the blocks B; will 
be called the n-th recurrency index of the set S. For example, let the blocks 
B; be the blocks 


(9. 4) B, 1221, B,—2112, B,—1212, B,— 2121. 


These blocks have a recurrency index r(1) = 3. The recurrency index 
does not exist. 

The following theorem concerns the /-trajectory (c*) derived from the 
[-trajectory (c) by substitution. The /-trajectory (c*) is formed as described 
above by substituting the respective s-blocks A; for the generating symbols a; 
appearing in (c). These blocks A; are then expanded to form the I-trajectory 
(c*) of the symbols a;. We shall apply the preceding definition of the recur- 
rency index of a set of blocks to the set 8 of 2s-blocks A;A;. Each m-block 
of (c*) for which ms + 1 lies in one of these 2s-blocks of (c*) so that in 
the case where the set 9 has a recurrency index r(m) = s + 1 it is clear that 
the recurrency index R*(m) of (c*) exists and is at most r(m). This is 
affirmed in part (a) of the theorem. Part (b) conditions K*(n) on the lower 
side instead of on the upper side. This theorem is useful in the following 


section. 


THEOREM 9.3. Let (c*) be an I-trajectory derwed from an I-trajectory 
(c) by substitution as described above. 

(a) If the set of 2s-blocks A,A; of (c*) has an m-th recurrency index 
r(m) Ss-+1, then 
(9. 5) R*(m) Sr(m). 


(b) If some block A; appears r times consecutively in (c*) with r >1, 
and if (c*) does not have the period s, then 


(9. 6) R*(s+1) > er, 
or else R*¥*(s-+ 1) fatls to exist. 


Statement (a) has already been proved. We therefore turn to the proof 
of (b). 

It is assumed that the block B = A;- - - Aj, of length sr, appears in (c*). 
Since the length of A; is s, any (s + 1)-block of B has equal terminal elements. 
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If R*(s-+ 1) exists and R*(s-+ 1) Ssr, every (s+ 1)-block of (c*) has a 
copy in B. Thus every (s+ 1)-block of (c*) has equal terminal elements and 
(c*) has the period s, contrary to hypothesis. 

The proof of the theorem is complete. 

The following theorem has a special interest in view of the mode of forma- 
tion of the Morse recurrent trajectory. The trajectory H thereby affirmed to 
exist is not necessarily admissible. It will be admissible if the only symbols 
employed are « and 


THEOREM 9.4. Corresponding to an arbitrary recurrent ray 


there exists a recurrent trajectory H with R as subray and with the recurrency 
indices of R. The trajectory H is in general not uniquely determined by R. 


Let B, be the initial r-block of R, r= 1,2,---. The block B, appears 
later in immediately preceded by some 1-block A,. Of the blocks Ay, - 
there exists an infinite subset S, with last symbol in common. We take a_, as 
this last symbol. Of the blocks of S, there exists an infinite subset S. with the 
second from the last symbol in common. We take a_, as this common symbol. 
Proceeding in this way we define sets S;,S4,- - - and symbols a_3, a_4,° °° 
respectively. That the resulting trajectory H is recurrent and has the same 
recurrency indices as #2 follows from the fact that each block of H appears in R. 

That H is not uniquely determined is shown by the trajectories 7’ and 7” 
of §8 which have the recurrent ray R of (8.4) as a common subray. 


10. Deferred recurrence. In this section we shall exhibit a recurrent 
trajectory H for which R(n) exceeds a prescribed function f(n) for infinitely 
many values of n but which is such that the limit inferior of R(n)/n is finite. 
That a recurrent trajectory exists for which k(n) exceeds a prescribed func- 
tion f(n) for all values of n has already been shown by Robbins ([1], Theorem 
2). Our example shows the possibility of extreme variability in the indices 
R(n). The trajectories / here defined form an extended class of which the 
trajectory T' of § 8 is a member. 

The generating symbols shall be the free symbols # and B. Let 19,71,° ° - 
be an arbitrary set of positive integers. We shall make use of powers of blocks, 
understanding that these powers are to be formally expanded. Set 


A = %, By B, 
A A 0”°Bo"®, B, A 


Ann = AntB,™, Bay = Brt*An". 
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Let s, be the length of A, and B,. Observe that 


(10. 1) Sn41 ns 


Let (c) be an I-trajectory such that the initial sn-block of the ray ¢o¢,c.° - - 


is A, for each n. This definition is self consistent since A,»_; is an initial 
block of Ay. We further set 


(10. 2) C_m-1 == Gm (m = 0, 1,- as 


Proceeding inductively we see that 


= An, = (m even) 
A om Bn, = As, (n odd). 


Hence the sn-block with final index —1 is An when n is even and B, when n 
is odd. If the symbols of (c) are grouped into s,-blocks with first indices 
= 0 mod s;, one obtains an J-trajectory H, with generating symbols A, and B,. 

If the integers 7; are all 1 and «1, B=2, (c) yields the recurrent 
trajectory T of § 8. We shall make use of the recurrency function p(n) of 7. 
The trajectory defined by (c) will be denoted by #. It is uniquely determined 
by the symbols and and the integers 


THEOREM 10.1. The trajectory E is recurrent and non-pertodic with a 
recurrency function R*(n) such that 


(10. 3) Sn41 = R* (sn ] ) 


Let B be an arbitrary (s, + 1)-block of #. The block B appears in some 
expanded 2-block of #,. But each such 2-block of #, appears in both Ans; and 
Bnsg as is easily seen. The latter two blocks are sn,s-blocks of H and one of 
them at least is in each 2sn,3-block of #. Hence the right-hand condition in 
(10.3) is valid. It follows that # is recurrent. 

The left condition in (10.3) follows from (b) in Theorem 9.3. For the 
block A» appears 27, consecutive times in #. Moreover FZ does not have the 
period s, since # contains the block A,B, and By is different from An. From 
(9.6) we then find that 

R* (sp) +1) = 2rnSn = Saar; 


and the proof of (10.3) is complete. Moreover we see that 
R* (sn +1) = 25n 


and it follows from Theorem 7. 6 that # cannot be periodic. 
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Jf ts an arbitrary positive integral function, it is 
possible to choose the r,’s or any subset of the ry’s so that for corresponding 
values of n, 


(10. 4) R*(s, +1) > f(s» +1). 


Recall that sy.; = 28,1'n. In order that (10.4) may hold we have merely 
to take 7, so large that 


and (10.4) will follow from (10.3). 
We continue with the following theorem. 


THEOREM 10.2. If the sequence of integers r; used in defining E con- 
tains arbitrarily long blocks of 1’s, 


R¥(n) — 11 


0.5 lim. inf. 
(10. 5) im. in 9 


If p(m) is the recurrency function of the trajectory T of § 8, there exists 


a sequence of integers m; becoming infinite with 7 and such that 


p(m) 11 


(10. 6) — 5 


We suppose integers m; so chosen. The sequence of integers 7; contains arbi- 
trarily long blocks of 1’s. It follows that there exists a sequence of positive 


integers 

such that for each 7, 

(10.8) The = Tha 1. 


Moreover these integers can be chosen so that 
ky — hy > p(m). 


We shall prove the following statement. 
(a). The m,-th recurrency index of Ey, is at most the recurrency indes 


p(m;). 


The trajectory 4, is generated from the blocks An,, Bi, These blocks 
combine to form the blocks Ax,, Bz, in exactly the way in which the symbols 
land 2 combine to form the blocks ay, and by, of the Morse trajectory T. In 
Fy, the blocks 


| 
Sn41 
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(10. 9) Ax, Bx, Ay, Ax, By Ary Br, 


regarded as blocks of the symbols Aj, and Bn,, accordingly have an mj,-th re- 


currency index p(m;). For 
p(mi) < ky — he < Qui, 


and 2“¢ is the length of the blocks A,, and By, regarded as blocks of the symbols 
An,, Bn,. It follows from (a) of Theorem 9. 3 that Hy, has an m,-th recurrency 
index at most p(m,), and the proof of («) is complete. 

We now regard £ as derived from Ex, by expansion of the blocks An, and 
Bn, of Hy, into blocks of @’s and f’s. Set 


(10. 10) Ni = Sn,(m,—1). 


An arbitrary n-block of # is found in some expanded m,-block of Hn,, and 
hence in each expanded p(m;)-block of and finally in each + sa, 
block of #. Hence 

(10. 11) R(m) S +1]. 


It follows from (10.10), (10.11) and (10.6) that 


lim. inf. i 


i-00 4 


and the proof of the theorem is complete. 
The preceding theorems and corollary combine to give use the following. 


THEOREM 10.3. If f(n) is an arbitrary positive integral function of 1, 
then for suitable choices of the integers r; defining E, the recurrency function 
R*(n) of E exceeds f(n) for infinitely many values of n while 

R*(n) — 11 


lim. inf. 
n->00 n 2 


IIA 


11. Transitive rays and their ergodic functions. We shall concern 
ourselves with rays of the form 
Col1C2° 


Such a ray will be termed transitive if it contains a copy of each admissible 
subblock. Transitive rays exist. If A,, A2,--- is an enumeration of admissible 
blocks, the ray 
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is admissible provided the symbols e; are successively chosen in a suitable way. 
The ray X is clearly transitive. Let R be an arbitrary ray and let n be a posi- 
tive integer. Suppose an initial r-block B of R contains each admissible n-block 
while no initial subblock of B has this property. The integer r will then be 
termed the n-th transitivity index o(n) of R. If R is transitive, the n-th 
transitivity index of F# exists for each value of n and will be termed the ergodic 
function p(n) of R. 

A block H(n) of minimum length containing each admissible n-block will 
be termed a minimum n-covering. The length 0(n) of H(n) will be termed 
the n-th covering index. In the case where the generating symbols are all free, 
Martin [1] has shown that for each n there exists a minimum n-covering in 
which no n-block is repeated. We shall presently extend Martin’s result to the 
case where successive inverses are prohibited. We are concerned here with the 
existence of transitive rays whose ergodic function is in some sense relatively 
small. Noting that each ergodic function (nm) satisfies the condition 


¢(n) = A(n), 


we are led to the question, does there exist a transitive ray such that 
$(n) = 6(n) for each n? The answer is no, in general, as we shall see. 

We begin with the case where there are just two generating symbols, 
namely the free symbols « and 8. In this case it follows from Martin’s theorem 
that 

O(n) = 2"+ n—1. 


The only minimum 1-coverings are #8 and Ba. The only minimum 2-covering 
which starts with «8 is aBBaa. If we wish a minimum 3-covering which starts 
with this 5-block we may continue with 8 or a, but we must then continue with 
a8 or BaB respectively, thereby obtaining the blocks 


(apBaa)Bap, 


Further continuation subject to the condition that no 3-block repeats is im- 
possible. Since 0(3) — 10, there exists no transitive ray for which 


p(n) = (n = 1, 2, 3). 


It is conceivable that there might exist a transitive ray for which 
¢(n) = 6(n) for all values of n exceeding some fixed integer m. That this 
is impossible is shown by the following previously unpublished theorem of R. 
Oldenburger. 
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THEOREM 11.1. If the generating symbols reduce to two free symbols 
a and B there exists no transitwe ray whose ergodic function ¢(n) equals the 
covering index 0(n) for two successive values of n > 1. 


The proof as given by Oldenburger is essentially as follows. 

We assume the theorem false. There then exists a minimum n-covering 
H{n), n> 2, which starts with a minimum (n—1)-covering H(n—1), 
No n-block repeats in H(n), nor (n—1)-block in H(n—1). We shall 
arrive at a contradiction. 

Since « and f can be interchanged throughout it is no essential restriction 
to assume that in H(m—1) the (n—1)-block 8" occurs before «"*. We 
shall then show that H(m—1) has the form 


(11. 1) H(n—1) = a- - - Ban*], 
and that H(n) has the form 
(11. 2) H(n) = H(n—1)[aB- - - 


The block of H(n) which remains when H(n — 1) is removed will be denoted 
by J. To establish the correctness of the representations (11.1) and (11.2) 
one reasons as follows. 

The block «” appears in H(n). It cannot appear wholly in H(n—1), 
for this would imply repetition of «”* in H(n—1), and thus Ba” appears in 
H(n). Since precedes in H(n—1), Ba" appears in H(n—1). 
Since Ba"-! appears just once in H(n), it must appear as the initial n-block 
of Ba", and since this latter block is not wholly in H(n —1), H(n —1) ends 
with Ba" and J begins with «. This « is of course followed by £. 

The block 8” is in H(n) and not wholly in H(n—1). Since J begins 
with a, 8” must appear wholly in J, and hence #8"! and «§” appear in J. 
It follows that 8"-!, which appears in H(n—1), must appear at the left of 
H(n—1) ; otherwise it would be preceded by a, and #8"! would appear twice 
in H(n). Hence H(n—1) begins with 8". If J does not end with pf’, 
Ba appears in J, and B"-« appears both in J and H(n—1). Hence J ends 
with a” and the representations (11.1) and (11.2) are correct. 

We now make use of the assumption that n > 2. Then n— 2 > 0 and 
a8" occurs in H(n—1). This block cannot be followed by £ without 
repetition of B"* in H(n—1). Hence a@"-*a appears in H(n—1). But 
B"-a already appears at the beginning of H(n—1). Hence H(m) cannot 
start with a minimum (n—1)-covering, and the proof of the theorem Is 


complete. 
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We turn to the case where the only condition on the generating symbols 
is the restriction of inverses. This is the simplest case of strict dynamical 
significance. It arises in the case of surfaces of negative curvature with 
boundaries. Cf. Morse [1]. In this case one has a set of generating symbols 
of the form 
(11. 3) ** * (p> 1), 


with the restriction that successive symbols shall not be inverses. We shall 
prove the following theorem by suitably modifying Martin’s proof. 


THEOREM 11.2. Lor generating symbols of the form (11.3), subject 
to no condition beyond the restriction of imverses, there exists a minimum 
r-covering in which no r-block repeats. 


The following rule for the formation of a minimum r-covering H (7) was 
first given by Martin for the case of free generating symbols ¢,,: - -,¢n. Start 
with c,""* and continue successively adjoining the generating symbol of highest 
index consistent with the condition that no r-block be repeated. This will lead 
to a minimum r-covering in the case of free symbols c;. For the case of in- 
verses we adjoin the condition that successive symbols shall not be inverses. 
With this condition added the process will not always lead to a minimum 
r-covering. We are able however to prove the following lemma. 


Lemma 11.1. A necessary and sufficient condition that the application 
of the Martin rule, subject to the restriction of inverses, shall lead to a mint- 
mum r-covering is that c, and C» shall not be inverses in the set ¢,,° - +, Cn of 
generating symbols. 


Let M, be the block of maximum length obtained by applying the Martin 
tule subject to the restriction of inverses. If A is any admissible (r—1)- 
block, the r-blocks 
(11. 4) Atn,* Ae; 


appear in M,, if at all, in the order written. One of the blocks (11.4) is 
inadmissible since the last symbol in A cannot be followed by its inverse. If 
Ac, is in M,, all other admissible blocks in (11.4) are in M,. If Ac, is not 
admissible, Ac. is admissible and is in M, only if all of its predecessors in 
(11.4) are in M,. We set = We continue with a proof of (a) and (b). 


(a). The block M, ends with FE. If M, ended with an (7 —1)-block 
ASF, M, could not begin with A, since M, begins with #. A would then 
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M, with c, (or c, if Ac, is inadmissible), contrary to the hypothesis that WV, 
ends with A. Hence M, ends with Z. 


(b). The block M, ends with Ec,. Suppose M, ends with an r-block 
BALc,. Set B= bE in accordance with (a). We are supposing that b + ¢,. 
The block # appears n times in M,; otherwise M, could be continued. Hence 
Ec, appears in M, but not at the end. But He, cannot be continued in ac- 
cordance with the Martin rule. From this contradiction we infer the truth 
of (b). 

We shall now make use of the assumption that c, and c. are not inverses 
and show that M, contains all r-blocks. 

We first observe that each admissible block Hc; appears in M, since M, 
ends with Hc,. Hence M, contains n copies of FE. It follows that M, contains 
each admissible block cjZ. Let B=b,---b, be an arbitrary admissible 
r-block. We shall show that M, contains B. 

Suppose that M, does not contain B. Set b.---b,—D. Then DAH 
by virtue of (b). Hence D appears at most n— 2 times in M,; otherwise 
each admissible block c;D would appear in M,, including B. Hence Dc, does 
not appear in M, (or Dez, if Dc, is inadmissible). We now apply the same 
reasoning to Dc, (or Dez) that we have applied to B and infer that b;- - - b-c,¢; 
(or b,: - - byc2c,) does not appear in M,. Continuing we arrive at the con- 
clusion that c," does not appear in M,, contrary to (b). 

The condition of the lemma is accordingly sufficient. 

To prove the condition necessary we suppose that c,; = c.-'. If M, were 
an r-covering ¢c.” would appear in M,, but not in the last position, by virtue 
of (a). But c.” cannot be continued without violating the Martin rule, or 
else the condition on inverses. Hence the condition is necessary, and the proof 
of the lemma is complete. 

Theorem 11. 2 is an immediate consequence of the lemma. 

If there are n generating symbols subject merely to the restriction of 
inverses, there are n(n —1)** different admissible r-blocks so that the r-th 
covering index 6(”) has the value 


With this understood we shall prove the following theorem. 


THEOREM 11.3. When there are n generating symbols subject merely 10 


appear in M, at most n — 1 times; otherwise some one of the n — 1 admissible 
blocks c,A would appear twice. Following A’s last appearance we can continue 
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the restriction of inverses, each ergodic function w(r) is at least 0(r) and 
there exists an ergodic function (17) such that 


1 
(11.5) < < 000). 


n — 

nN — 
Let H, be a minimum r-covering and let H*, be the block obtained from 

H, by omitting the first r— 1 cymbols c,. The ray 

R = H,H*,H*;: - 


is admissible and transitive. We note in particular that H*, is preceded by 
¢' in R. For this ray ¢(1r) is at most the length of the block H,H*,: - - H*,, 
that is 

o(r) SY 


m=1 


Hence 


and (11.5) follows at once. 


We conclude this section by proving the following theorem. 


THEOREM 11.4. Jf N(1) is the number of admissible r-blocks under our 
general conditions of admissibility, there exists an ergodic function (1) such 
that 
(11.6) o(r) < (r+1)N(r), 


while each ergodic function is at least N(r) + r—1. 
We begin by proving the following. 
(a). There exists an r-covering of length less than (r+ 1)N(r). 


Statement (a) is true ifr—1. For if a,,: - -,d, 1s the set of generating 
symbols, there exists a 1-covering of the form @n-1@n provided the 
symbols e; are suitably chosen. This 1-covering has the length 2n—1 
<2N(1). We assume then that A, is an r-covering of length less than 
(r+1)N(r). We shall show how to form an (r+ 1)-covering of length 
less than (r + 2)N(r +1). 

There are N(r) distinct r-blocks in A,r and hence at least N(r) —1 
distinct (r + 1)-blocks. There remain at most 
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p—N(r-4+1)—N(r) +1 
(r+ 1)-blocks, say B,,- --,By, not in A. The block 


in which the symbols e; are chosen so as to make A,,, admissible, is an (7 + 1)- 


covering. Let s; be the length of A;. We see that 


Sp + (7+ 2) 


By virtue of the hypothesis that s- << (r+ 1)N(r) we infer that 


(11.7) Sra + +2) +1) +1] 
= +742. 
But for r > 1, 
N(r) 


so that (11.7) yields the relation 
Sra < (71+ 2)N(r+1), 


and the proof of (a) is complete. 


In proving (a) we have defined r-coverings A,r, r= 1, 2,- - -, such that 
A, is an initial block of A,,,. A ray whose first s, symbols are those of A, for 


each r will satisfy (11.6). The final statement of the theorem is immediate. 


12. Category and sets of transitive rays. In this section we shall 
determine the category of the set of transitive rays or of sets of relatively 
transitive rays. More particularly, we shall determine the category of various 
subsets of transitive rays defined by limitations on the ergodic functions. In 
this way we obtain an idea of the distribution of transitive rays. A similar 
use can be made of measure theory (cf. § 13), but the development of this 
phase of the theory is deferred to a later paper. 

In the category theory, the space used will be the space § of admissible 
rays with the Baire metric. Cf. §4. Recall that the space § is compact, 
_ perfect, and totally disconnected. Let H be a perfect subset of 8. A set A is 
of the first category relative to H if it is the sum of an at most enumerable 
set of nowhere dense (relative to H) subsets of H. These subsets will be 
called the components of A. The components of A may vary from the extreme 
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case where they are all null to the case where they are perfect. The com- 
plement H — A of a set A of the first category is termed a residual set of the 
second category. Such a set is everywhere dense in H and has the power of 
the continuum. Cf. Hobson [1], p. 132. 

Let & be a given ray. The rays obtained by deleting a finite initial block 
of R will be said to be based on RF, those obtained by the inverse process of 
adding a finite block to R will be termed extensions of R. The rays which are 
limit rays in the space S of the set of rays based on R will be called limit rays 
of R. A ray R which is a limit ray of itself will be said to be transitive relative 
to its limit rays. It is thereby necessary and sufficient that R be non-periodic 
and that every block of R appear infinitely many times in R. The limit rays 
of a relatively transitive ray R form a closed set H. This set is identical with 
its derived set since each ray based on F is a limit ray of the set. The set H 
is accordingly perfect. Concerning the set H we have the following theorem. 


THEOREM 12.1. In the set H of limit rays of a relatwely transitive ray 
R, the rays which are transitive relative to H form a residual set in H. 


Let A;,A:,- - - be an enumeration of the blocks of R. Let Ey be the 
subset of rays of H which contain no copy of An. The set #, is closed and 
contains no rays based on #. It is nowhere dense in H since the rays based on 
Ff are everywhere dense in H. Finally the set § of rays which are transitive 
relative to H is of the form 

S =H — 


and the theorem follows from the definition of a residual set in H. 

The above sets H, may all be null, in fact are all null if R is a subray 
of a non-periodic recurrent trajectory. In the case where F is transitive, that 
is transitive relative to the set of all admissible rays, the sets #, are never null 


as the following lemma shows. 


LemMA 12.1. The set K of rays which exclude an admissible r-block B 
for which r > 2 is closed, nowhere dense, and of the power of the continuum. 


It is clear that K is closed. That K is nowhere dense in S follows from 
the existence of a transitive ray R. For the rays based on R are everywhere 
dense in § and transitive. Since K is closed, it must then be nowhere dense. 

It remains to prove that K is of the power of the continuum. To that 
end we consider the admissible rays 


(12. 1) 2 or 3, 
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formed from the free symbols « and 8. This set is in one-to-one correspondence 
with the dyadic decimals 
* or 3, 


and hence is of the power of the continuum. 
Similarly, the set 


(12. 2) - -, m= 2 or 3, 


consists of admissible rays and is of the power of the continuum. 

A ray of the form (12.1) has no 3-blocks in common with a ray of the 
form (12.2). Since the length of B is at least 3, it cannot appear in a ray 
of (12.1) and a ray of (12.2). Hence at least one of the sets (12.1) and 
(12. 2) is contained in K. Thus K must be of the power of the continuum. 

The set of non-transitive rays is of the first category. But much more than 
this can be said, as the following theorem shows. 


THEOREM 12.2. The set & of non-transitive rays is everywhere dense in 
S and is the sum of countably many nowhere dense, closed sets, each of which 
is of the power of the continuum. 


That the set % is everywhere dense follows from the fact that an arbitrary 
block B and a sequence A of the symbols a can be combined in the form BeA 
to form a non-transitive ray. Rays of this form are everywhere dense in S. 
Let A,, As,: - - be an enumeration of admissible r-blocks with r > 2. The set 
EH, of rays which exclude A, is closed, nowhere dense and of the power of the 
continuum, in accordance with Lemma 12.1. The set & is the sum of the sets 
E,, and the proof is complete. 


THEOREM 12.3. If f(n) is an arbitrary positive function of the positwe 
integers n, the set F of transitive rays with ergodic function ¢(n) > f(n), for 
all values of n, has the power of the continuum. 


Let be an enumeration of admissible blocks consistent with 
their lengths. Let ;,72,- - - be an arbitrary sequence of positive integers and 
let ¢:c2° - - be an arbitrary sequence of the symbols « and B. We introduce 
the block 

B; = a’ A; (t= 1,2,- °°), 


choosing e; so that B; is admissible. 


We then introduce the ray 
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choosing the symbols b; so that the ray H is admissible. It is clear that F is 
transitive. 

Let ¢() be the ergodic function of #. We choose r, so that @(n) > f(n) 
for n= 1, 2,3. Let B be the initial f(4)-block of #. If rz is chosen suffi- 
ciently large, blocks A,, As,- - -, Ap entering into the definition of B will all 
be 1-blocks. In fact, if rz is sufficiently large only A, will enter into the 
definition of B. If A; is a 1-block, e;A,;b; is a 3-block so that B cannot contain 
With rz so chosen, (4) > f(4). 

Proceeding inductively with m > 2 we choose 7, so large that in the 
initial f(m + 2)-block C of #, at most (m— 1)-blocks A; are employed. The 
corresponding blocks e;A;b; are at most (m + 1)-blocks so that B”** does not 


appear in C. Hence 
(12. 3) + 2) > f(m-+ 2) (m = 3,4,- 


Thus ¢(m) > f(x) for all values of n. 

The above symbols c; can be chosen arbitrarily from the set a, B, the ri’s 
remaining fixed. For fixed 1;’s there are then as many different rays EF as 
there are different sequences ¢,¢2° - -, that is a number equal to the power of 
the continuum. 

The proof of the theorem is complete. 

The set F' of the theorem is not closed. The set of transitive rays not in 
F is open in the set of all transitive rays so that F is everywhere dense only 


if it includes all transitive rays. 


THEOREM 12.4. Given an arbitrary function f(n), the set w of transitwe 
rays R with ergodic functions d(n) Sf(n) for n exceeding some integer de- 
pendent on R is a set of the first category. 


Let Dm be the set of transitive rays for which ¢(n) Sf(n) for n ex- 
ceeding m. The set Dm is closed. But we have seen in Theorem 12. 2 that 
the set of intransitive rays is everywhere dense. Hence Dm is nowhere dense. 
jut = and the proof is complete. 

Let H(n) be a block of minimum length 6(n) covering all admissible 
n-blocks. We have termed 0(n) the n-th covering index. As we have seen in 
$11, there is in general no transitive ray with ergodic function ¢(n) = 6(n). 
We have however the following theorem. 


THEOREM 12.5. The set G of transitive rays with ergodic function $(n) 
such that 
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(12. 4) 


where 0(n) is the n-th covering index, is a residual set of the second category. 


We begin with a proof of the following. 
(a). The set G is not empty. 


To prove (a) let 8, be a sequence of positive constants converging to zero, 
We shall consider a ray of the form 


R= H (ro) e:H(1;)¢2H (12) 


where H(n) is defined as above and the symbols e; are chosen so that P is 
admissible. The integers 7; are chosen as follows. Let r>5 1. Proceeding 
inductively we suppose that rn, is defined and that the block 


H(19)e:H (11) (Tn-1) en 
has the length m,. We then choose 1, so large that 


(12. 5) + mn 


If ¢(n) is the ergodic function of the resulting ray FR, it is clear that 
S + Mn, 


since the initial block of length 6(7n) + mn in R contains H(rn) and hence 
all r'n-blocks. The function ¢(m) will satisfy (12.4) by virtue of (12. 5), and 
the proof of (a) is complete. 

If A is an arbitrary admissible r-block, the ray X = AeR is admissible if 
e is suitably chosen. Moreover the ray FR and the ray X have ergodic functions 
¢ and wy respectively such that 


O(n) Sy¥(n) So(m) +r+1. 


Hence X satisfies the condition (12.4) andisin G. We conclude that the rays 
of G are everywhere dense. 

To come to a proof of the theorem let m be a positive integer and p 4 
rational number > 1. The rays for which the transitivity index y(m) satisfies 
the condition 
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v(m) > 
form a closed set D,,”._ The intersection 


isclosed. The complement of G is the sum of the sets H as q ranges over all 
positive integers and p ranges over all rational numbers > 1. But G is every- 
where dense so that H” is nowhere dense, and the proof of the theorem is 
complete. 


THEOREM 12.6. The set Z of transitive rays with ergodic functions $(n) 
such that 
p(t) 
lim. inf.——- > 1 
9(n) 


is everywhere dense, has the power of the continuum, and 1s of the first category. 


The set Z includes the set of transitive rays for which ¢(n) > 20(n), 
and so has the power of the continuum in accordance with Theorem 12.3. The 
set Z is a subset of the complement of the residual set G of Theorem 12. 5 and 
so is of the first category. It remains to show that Z is everywhere dense. 

If F is a ray of Z, any ray FR’ based on FR is again transitive. If the 
initial symbol of FR’ is the (r + 1)-st symbol of R, the ergodic functions y(n) 
and ¢(n) of R’ and R, respectively, satisfy the conditions 

¥(n) 


W(n) p(n) ° > 1. 


Hence f’ belongs to Z. But the rays R’ based on the transitive ray PR are 


everywhere dense so that Z is everywhere dense as stated. 
The proof of the theorem is complete. 


13. Trajectories with the separation property. The results of this 
section will form the basis of a more extensive treatment to be presented later. 
No proofs will be given in this place. 

Let ¢ be a trajectory involving two generating symbols « and f. By the 
alength (B-length) of a block B of ¢ we shall mean the number of @’s (f’s) 
in B. The trajectory ¢ will be said to have the separation property and be of 
type § if the following holds. 


The a-lengths of any two n-blocks of t differ by at most 1. 
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The preceding condition implies the corresponding condition for B-lengths, 
Trajectories of type S arise in many ways. In particular let 


(13. 1) + f(x)y= 


be a differential equation in which f(x) is continuous and has the period 1, 
Let u(x) be a solution of (13.1) which vanishes at least once, but not identi- 
cally. We are concerned with the distribution of the zeros of w(x). ’ Without 
loss of generality we can suppose that none of these zeros occur at the points 
«=n, where n is an integer, positive, negative or zero. For the zeros of u(z) 
are countable, and for our purposes any transformation of the form =z +¢ 
is admissible. With this understood let a trajectory ¢ be defined as follows, 
Let each point on the z-axis at which u(x) = 0 be replaced by the symbol 4, 
and each point z = n be replaced by the symbol 8. The resulting trajectory t 
will be of type 8S. With trivial exceptions every trajectory of type S can be 
obtained in this way. The study of trajectories of type S is thus a study of 
the zeros of solutions of equations of the form (13.1). 

Let %n(Bn) denote the «-length (8-length) of some n-block of a trajectory 
t of type S. It can be shown that there exists a number 6, 0 = @ S 1, such that 

Bn 


& 
lim. — = 6, lim. — = 1— 6. 
noo 


The number 6 will be termed the rotation number of t. Cf. Poincaré [1]. 


The trajectory is not uniquely determined by the rotation number. If é 
is irrational, there are infinitely many trajectories of type S having 6 as rotation 
number and these trajectories are all recurrent. Thus trajectories of type § 
are in general recurrent. A simple exceptional example is the trajectory in 
which all the symbols are a except for one 8, in which case the rotation num- 
ber is 1. 

Given 6, 0 = 61, the recurrent trajectories of type S which have @ as 
rotation number form a minimal set which consists of a single periodic tra- 
jectory if and only if @ is rational. Since all of the trajectories of one of these 
minimal sets have the same recurrency function, it can be denoted by Ky(”). 
The behavior of Rg(n) varies widely with the choice of 9, but in all cases 


lim. inf. = 4. 


There exist irrational values of 6, and thus non-periodic recurrent tra- 
jectories such that 
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Ry (n 
lim. sup. =e) = 6. 6. 
n—>0o 
Qn the other hand, given an arbitrary function f(n), there exist irrational 
values of 6 such that Ry(n) > f(n) for infinitely many values of n. 
If certain exceptional 6-sets of (Lebesgue) measure zero are excluded, 

R,(n) displays great regularity in its asymptotic behavior. We can state that, 
except for a 6-set of measure zero, 


=-+ 0, 


lim. sup. 


fg 
n 
while if c > 1, except for a 6-set of measure zero, 


lim. sup. 4. 
n 

14. Typical open questions. Let F be a ray of the form coc,c.- - - and 
let ¢ be a trajectory containing F as a subray. Among the positive limit tra- 
jectories of ¢ are found one or more distinct minimal sets of trajectories. Cf. 
Theorem 6.1. Suppose v; of these sets are periodic trajectories, and vz of these 
sets general minimal sets. The number 1, is at most aleph null, and vz is at 
most aleph. That v. may equal aleph will be shown by an example in a later 
paper. The numbers v, and vz are uniquely determined by R and will be 
called the limit indices of R of the first and second types respectively. Our 
first question is as follows. 


(1). What are the limit indices of algebraic numbers, regarding decimal 
representations as rays? Are these rays ever transitive or recurrent? What 
are the answers to similar questions concerning m and e? 

(2). Let ¢ be a non-periodic recurrent trajectory. In all known examples 
R(n)/n fails to converge. Is this true in general ? 

(3). If ¢ is a non-periodic recurrent trajectory, R(n) = kn where k = 2. 
Cf. Theorem 7.5. What is the least value of k such that R(n) = kn for each 
hon-periodic recurrent trajectory ? 

(4). Inthe case of general admissibility certain blocks and finite sequences 
of blocks are prohibited. Knowing these prohibitions, is it possible to give a 
constructive rule for the formation of a minimum block covering all admissible 
t-blocks, and what is the most general such r-covering ? 

(5). Does there exist an ergodic function ¢(n) such that the ratio of 
$(n) to 6(n) tends to the limit 1? Cf. Theorem 12. 5. 
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(6). Is a theorem similar to that of Oldenburger true in case the gen- 
erating symbols suffer the restriction of inverses, or in the general case? 

(7). What is the most general minimal set with a given recurrency fune- 
tion K(n)? With permutation index P(n)? With P(n) and R(n) both 
given? These questions should be answered first when there are just two 
generating symbols. 


These questions seem to point to an algebraic analysis. Questions more 
intimately connected with measure theory, or the analysis of space forms will 
be presented in a later paper. 
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PERIODIC ORBITS IN THE RESTRICTED PROBLEM OF THREE 
BODIES IN THEIR RELATION TO HILL’S WORK 
ON THE MOTION OF THE MOON. 


By Eis StROMGREN. 


The first volume of the American Journal of Mathematics (1878) contains 
two papers by G. W. Hill that have become of fundamental importance in the 
theory of the motion of the moon, and which, besides, contain a few remarks 
that may well be looked upon as the starting point of important chapters of 
that special case of the problem of three bodies known as “The restricted 
problem of three bodies,” or “ Probléme restreint.” 

The Probleme restreint was first defined by Jacobi: Three bodies are given, 
two of which have finite masses and are moving around each other in circular 
orbits according to the laws of the problem of two bodies. The problem is to 
study the motion of the third body, assumed to have an infinitely small mass, 
when moving in the plane of the two finite masses under the influence of the 
attraction from these two masses. With regard to the mass ratio, it should be 
noticed that Poincaré in his investigations of the restricted problem always 
assumes that one of the two finite masses is very small relative to the other, 
thus reducing the problem to a perturbational problem, which can be solved 
with the aid of series expressions in powers of a small quantity, but which 
on the other hand enables us to study the nature of the motion in the neigh- 
bourhood of known exact solutions only. 

In the work carried out at the Copenhagen Observatory on the other 
hand, the two finite masses have been assumed to be equal, and the scope of 
the problem is essentially enlarged, it being desired to follow all investigated 
classes of orbits throughout the whole of their evolution. This problem, how- 
ever, not being a simple perturbational problem, could only be solved with 
the aid of numerical methods of integration. 


The cusps in Hill’s variational curves. Plate III in vol. 1 of the Ameri- 
can Journal of Mathematics gives (cf. Fig. 1 in this paper) a series of lunar 
orbits computed by Hill in the idealized variational problem, in which the 
parallactic factor is put equal to zero. In the diagram we see four variational 
curves from 1 to 4, of which the last has two symmetrical cusps, one above, 
and one below. 
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With regard to the orbits in a continued evolution of these variational 
curves Hill says as follows on p. 259 (Collected Works, I, 334) : “It is certain 
that the orbits, if they do exist, do not intersect the line of quadratures, and 
that the moons describing them would make oscillations to and fro, never 
departing as much as 90° from the point of conjunction or of opposition.” 
This was only a conjecture, and it was shown to be false by Poincaré in 
Méthodes Nouvelles, I, 106 ff. (1892). Here Poincaré shows by a simple 
mathematical procedure that the cusps in Hill’s variational curves in the 
further evolution simply become loops. Later (1907) in the Astronomische 


Fig. 1. 


Hill’s variational curves. 


Nachrichten No. 4155, the author of this article drew attention to the fact that, 
in the restricted problem, loops and cusps can exist at any point in the plane, 
with the exception of certain special points. With regard to these loops and 
cusps, the author found a general theorem, to which we shall return. 


Hill’s division of the plane of motion into permitted and forbidden 
regions. In the first of the two papers in the first volume of the American 
Journal of Mathematics quoted above, Hill, with the aid of the so-called 
Jacobian integral, makes his famous investigation of the permitted and the 
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forbidden regions in the plane of motion. The division is made with the aid 
of the so-called curves of zero velocity. These curves are shown in Fig. 2 for 
the mass ratio of the two finite masses m,; = 10m, used by G. H. Darwin in 
his “ Periodic orbits.” Plate I shows the curves for the mass ratio m, = mz 


¥ 


Fig. 2. 


Zero velocity-curves in the restricted problem of three bodies, with m, = 10m,. 


suggested by Thiele and adopted in the work at the Copenhagen Observatory 
(Plate I is based upon a drawing made by the Moscow astronomer N. Moisseiev. 
The scale and the values of the so-called Jacobian constant are the same as in 
the work at the Copenhagen Observatory, cf. for instance Publ. No. 100 of the 
Copenhagen Observatory, or Bulletin Astronomique, Deuxiéme Série, Tome IX, 
Fase. 2, 1935). In both of these drawings we have had cusps and loops drawn 
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especially, to illustrate the general theorem referred to above (Astronomische 
Nachrichten No. 4155): When, located on that part of the normal of a zero 
velocity curve which belongs to the permitted region, we look in the direction 
toward the curve,tthen the motion of the infinitely small mass in a cusp is 
always from right to left. As is seen from Fig. 2 and Plate I the direction 
of motion in a loop,also follows immediately from this theorem. It should be 
noticed, that in these diagrams, as in Fig. 1 and all the following diagrams, 
the X-axis is the line from m, to mz, which, in a fixed codrdinate system is 
rotating with uniform velocity. The 5 points of libration are denoted by 
Dy, Le, Ls, Ls, Ls in Fig. 2. 


The evolution of a cusp during the evolution of a class of orbits. The 
programme of the work at the Copenhagen Observatory originally was this: 
to find all classes of simply-periodic orbits around the finite masses and around 
the points of libration, and to follow these classes through the whole of their 
evolution. This programme has at the present time, with one single exception 
(cf. Publ. 100, pp. 26-27), been completely carried out. As the work pro- 
gressed, however, this programme was automatically considerably extended, 
as we were, during the investigations, led to study quite different possibilities 
of finding periodic solutions in the problem. 

Among the classes of orbits on our original programme was, amongst 
others, the following: It was clear a priori that at great distances from the 
origin orbits with slow, almost circular motion would exist, which in a fixed 
coordinate system were direct (motion in the same direction as the two finite 
masses), and—on account of the rotation of the m,m,-axis—rapid retrograde 
in the rotating coordinate system. Fig. 3 shows a number of orbits in this 
class. Far out we have an almost circular orbit (K = 14.4361). Then we see 
an orbit (K = 12.60816) of markedly oval shape, and still further inwards 
we encounter an orbit (K = 11.76718) with a marked indentation. This 
indentation develops in the evolution that follows into a cusp, and then into 
a loop. 

Fig. 4-7 show how the evolution is continued. The evolution ends in the 
point of libration L, with asymptotic motion into as well as out from this 
point of libration, on the ingoing branch with an infinite number of steadily 
decreasing windings, with the velocity converging toward zero, on the outgoing 
branch the reverse. Fig. 8 shows the ingoing branch only. 

This was an example of the asymptotic motions predicted by Poincaré in 
Méthodes Nouvelles on a theoretical basis, the asymptotic motion being here 


periodic also. 
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Fig. 8. 


The evolution of cusps and loops up to asymptotic motion. 
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Asymptotic orbits, that are also periodic, with an infinitely long period. 
We have seen how the class of orbits shown in Fig. 3, which begins at infinity, 
ends with a periodic orbit of infinite period, asymptotic to the two points of 
libration Z, and L;. This is a special case of a general theorem, that was 
deduced in an empirical way as a result of our investigations: It has been 
found that, as far as our experience goes, a class of periodic orbits always, 
either 


I) goes back into itself, so as to form a class closed in itself, 
or II) has a natural beginning and a natural ending. Such a natural be- 
ginning or natural ending always takes place: 
either a) in one of the two finite masses, or in both, 
or b) in one of the 5 libration points, or in two of them, 
or c) at infinity. 


g 
y 
Mo 

PA 
x A 
Fig. 9. 


It now turned out that the study of the class of orbits shown in Fig. 3 
had very extensive consequences. We see how an asymptotic-periodic orbit acts 
as the boundary orbit of a class of periodic orbits. The following idea was 
now rather obvious: If we can find other asymptotic orbits, which are also 
periodic (with infinitely long period), will it not then turn out that these 
orbits are boundary orbits of classes of periodic orbits? 

The way that had to be followed now was very simple. We computed 
various orbits, that were asymptotic from the point of libration Ly. 

We started the calculations with the aid of the theoretical expressions for 
the motion infinitely close to the libration point. 

Fig. 9 defines the problem in the fixed codrdinate system with center of 
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gravity of m, and mz at the origin. The differential equations of the motion 
of the infinitely small mass are, in the fixed codrdinate system, and with our 
choice of units, as follows: 


d*x \.2+cost , | 
dt? ) + 
ytsint y—sint \ 
= (x + cost)? + (y+ sin t)? 
= (4 — cost)? + (y—sint)?. 


Referred to the codrdinate system rotating with the line m,mz2 the equa- 


tions are: 


dt? dt p2® 
d*y dé 
(2) 4 dt? + dt p2® 


pi == (€ 4-1)? 
= (€— 1)? + 


If we transfer the origin of the codrdinate system to DL, and neglect 
powers of the codrdinates higher than the first we obtain the following equa- 


tions of motion: 


3 
dt? 

(3) dy ,,d¢_ 9 
ae ta 


which are, therefore, the differential equations of the infinitesimal motion near 
L,. The solution for the ingoing motion is given in terms of the functions 


et sin bt and cos bt, 


that for the outgoing motion in terms of 


e*4t sin bt and e*#* cos bt, 


i.e. we have an asymptotic motion. 

We follow the motion with the aid of the infinitesimal equations up to 4 
point, which is very close to L,, and then continue the solution with the aid 
of numerical integration of the exact differential equations (2). 
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If we follow the motion until the orbit cuts the X-axis for the first time, 
we shall, in case the orbit should be perpendicular to the X-axis, have a 
periodic orbit—due to the symmetry of the differential equations. The orbit 
will continue symmetrically with regard to the X-axis, and asymptotically 
approach L;. We may imagine the orbit continued in an orbit symmetrical 
with respect to the Y-axis, back to L4, where the motion started. 

The problem of finding such orbits, which are perpendicular to the X-axis, 
presents no difficulties. The method is to compute a number of orbits that 
eut the X-axis under various angles, interpolating finally between such of 
these orbits as make an angle of 90° with the X-axis. 


G 


Fig. 10. 


The 5 asymptotic-periodic orbits which are already perpendicular to the «-axis 
after a quarter of a revolution. 


We have in Copenhagen made a large number of such calculations. Plate 
II gives the essential of the results obtained. Examining this drawing we find 
that 5 orbits exist, not more and not less, which are already perpendicular to 
the X-axis after a quarter of a revolution. Fig. 10 shows these five asymptotic- 
periodic orbits (for the sake of simplicity only about half of the orbits have 
been reproduced). 

The question now arises, what does it mean that, in this way, a number 
of orbits exist, that are periodic (with infinitely long periods) and at the same 
time asymptotic to and from L, and L;? Now, if for instance we investigate 


| 
» 


876 ELIS STROMGREN. 


Fig. 11. 


Two asymptotic orbits as boundary orbits of a class of periodic orbits. 


PERIODIC ORBITS IN THE RESTRICTED PROBLEM OF THREE BODIES. 
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orbit V in Fig. 10, we find that this orbit is identical with the boundary orbit 
that was found during the study of the class of orbits mentioned before, which 
had its origin at infinity. 

A more detailed investigation showed that the four other orbits are also 
boundary orbits of classes of periodic orbits. Thus we have for instance a 
class of periodic orbits the boundary orbits of which are the orbits III and IV 
in Fig. 10. Fig. 11 reproduces these two boundary orbits drawn completely, 
In between these two boundary orbits we have to imagine a class of orbits— 
infinite in number—the boundary orbits of which are the two asymptotic orbits, 

Herewith we have been led to a quite new principle of finding classes of 
periodic orbits. Formerly the starting point was those periodic orbits, which 
were known to exist near the masses or the libration points, and through 
laborious numerical calculations the evolution of each separate class was fol- 
lowed, until the whole evolution of the class up to the natural boundary curves 
had been covered. Now we are faced with the inverse problem: Once we have 
found an asymptotic orbit, which is also periodic (with infinitely long period), 
then we also have a boundary orbit for a whole class of periodic orbits. 

We shall not enter into further details, but only touch upon one further 
point. We found the 5 boundary orbits in Fig. 10 by looking for asymptotic 
orbits which cut the X-axis at right angles. Of course we can also investigate 
the problem of finding orbits asymptotic to Z4, which do not cut the X-axis 
at right angles, but do intersect the Y-axis perpendicularly, one quadrant 
later. We shall have to deal, in that case, with orbits that are not—like 
asymptotic to L;, but which, having cut the Y-axis, 


those 5 mentioned before 
continue in a branch symmetrical to the first with respect to the Y-axis, and 
end in the libration point LZ,, where they started. Our investigations now 
showed that 6 such orbits exist. They can be found on Plate II, and they 
are all given together on Plate III (No. 2, 3, 4, 5, 6, and 9). All of these 
orbits are boundary orbits of classes of periodic orbits, and of course the 
reasoning can be extended: We can look for all orbits, asymptotic out of Li, 
which on cutting the X-axis for the second time are perpendicular to it, and 
orbits, which on cutting the Y-axis for the second time are at right angles 
to it, etc., to infinity. In this way we will get boundary orbits of an in- 
finitely increasing number of classes of more and more complicated periodic 


orbits. 


Hill’s investigations of the variational curve of the moon and of the 
problem of the curves of zero velocity introduced new principles into lunar 
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theory. Poincaré showed the importance of periodic orbits and proved the 
existence of asymptotic orbits in the problem of three bodies. Thiele-Burrau 
and G. H. Darwin introduced the methods of numerical integration in the 
investigations of the restricted problem. In these facts we have the origin 
of the work on the restricted problem that has been carried out at the Copen- 
hagen Observatory. 


ASTRONOMISK OBSERVATORIUM, 
K¢BENHAVN. 
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ZUR THEORIE DER DIRICHLETSCHEN REIHEN. 


Von Orto TorEPLitz. 


Die mathematischen Gedanken, die G. W. Hill in seiner beriihmten Arbeit 
iiber das Mondperigium entwickelt hat, sind eines der Hinfallstore geworden, 
durch die man den Zugang zum Bereich der unendlich vielen Verinderlichen 
gefunden hat. Von dieser Arbeit Hills angeregt, hat H. Poincaré die all- 
gemeinen Sitze tiber unendliche Determinanten ‘entdeckt, die H. von Koch 
zu einer geschlossenen Theorie abgerundet hat und die I. Fredholm als ein 
Vorbild bei seiner Konzeption der allgemeinen Theorie der linearen Integral- 
gleichungen dienten. Der Allgemeinheitsgrad dieser Sitze, der in dieser 
historischen Entwicklung als ein entscheidendes Moment erscheint, ist bei 
Hill explizite nicht anzutreffen; er handelt scheinbar von einem speziellen 
Gleichungssystem. Das ist nur scheinbar der Fall. In Wahrheit hat er als 
ein genialer numerischer Rechner aus den Zahlen seines Spezialfalles die 
allgemeinen Gedanken herausschauen lassen, die er an der Hand des Sonder- 
falls erkannte, und vielleicht sind diese Gedanken von der Theorie der unend- 
lich vielen Variablen noch heute nicht voll ausgeschépft worden. 

Obgleich die vorliegende Arbeit zu der oben beriihrten Aufgabe keinen 
direkten Bezug hat, soll sie doch Zeugnis ablegen von dem Andenken, das die 
Lehre von den unendlich vielen Variablen dem Geiste Hills schuldet. 


Die Arbeit ist der unveriinderte Abdruck von einem Teil eines Manuskripts. das 
im Jahre 1918 niedergeschrieben worden ist, in den Jahren 1918-1922 einer Reihe von 
Fachgenossen vorlag und im mathematischen Seminar der Hamburger Universitit 
referiert worden ist. Herr A. Wintner, dem ich fiir die Anregung zum Abdruck und 
fiir die Besorgung der Redaktion besonderen Dank schulde, hat dieses Manuskript im 
September 1929 kennen gelernt. 


1. Die D-Formen. Es sei f(t) eine fiir <t< +o erkliarte 
Funktion, fiir welche alle Hadamardschen Koeffizienten 1 
existieren, wobei Mt? die Operation der Mittelbildung bezeichnet: 
M{- - -}— lim — dt, 
W=+00 ® -w 
Und zwar soll f(ti) derart sein, dass die Hadamardschen Koeffizienten ¢(A) 
ausser fiir lg 2, lg 3,- - - verschwinden. Der Koeffizient c(lg®) 


1J. Hadamard, Acta Mathematica, Bd. 22 (1899), insh. S. 60-63. 
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werde kurz als cy bezeichnet, also insbesondere c(0) als c,. Speziall wird also 
nach Hadamard und Schnee eine solche Belegung stets dann vorliegen, wenn 
man eine spezielle Dirichletsche Reihe Sc,/n* hat, die auf s = tt absolut oder 
iiber s = 7% hinaus bedingt konvergiert, und wenn man ihren Summenwert 
f(tt) als Belegung nimmt. 

Alsdann betrachte man den nicht-negativen Ausdruck ? 


= (2,0, + + + + +--+) 
+ + + +++) 2-4 
+ + (%) + Hann", 


multipliziere ihn mit f(t7) und bilde den Integralmittelwert nach ¢: 
(2) Dn(a, &) f(t) (4, + (a, + ++ }. 
Nun ist vorausgesetzt, dass 

M (f(t) n**) cy, oder 0, 


je nachdem n positiv-ganz ist oder nicht. Fiihrt man also in (2) unter dem 
Integralzeichen statt der linken Seite von (1) die rechte Seite ein, so ver- 
schwindet die Mehrzahl der Summanden, und es bleibt: 


Ist nun iiberdies die Belegung f(ti) beschrinkt, | f(t7)| M, so kann 
man auf das Integral in (2) den ersten Mittelwertsatz der Integralgleichung 
anwenden, da der Faktor von f(t) eine reelle, nicht-negative Funktion ist, 
und erhialt 
also unter Benutzung der Relation (1) : 

da Mt{e**} verschwindet oder gleich eins wird, je nachdem A ~ 0 oder A = 0 ist. 

THEOREM I. Befriedigt eine Belegung ausser den eingangs dieses Para- 
f(t)| SM, und ist 
H, das Maximum des Ausdrucks (3) unter der Nebenbedingung x,%,+°-- - 


+ = 1, so ist Hn S M fiir jedes n. 


graphen aufgefiihrten Voraussetzungen noch die weitere 


* Hierbei sind die Summen in den Klammern alle soweit zu erstrecken, als die 
Indizes =n bleiben. 


elt 
en, 
1en 
il]- 
ch 
‘al- 
ser 
bei 
len 
als 
lie 
er- 
en 
lie 
las 
‘OD 
tit 
ind 
im 
rte 


882 OTTO TOEPLITZ. 


BEMERKUNG. Bedient man sich der Sprechweise der Theorie der unend- 
lich vielen Variablen, so ist D,(a, Z) nichts anderes als der n-te Abschnitt der 
Bilinearform von unendlich vielen Variablen 


+ + +° 

nur dass in den n-ten Abschnitt D,(z, y) hiervon noch y; = %,° + +, Yn = Gn 
gesetzt ist. Nun besagt ein einfacher Hilfssatz* iiber Bilinearformen von 
endlich vielen Veranderlichen A(x, y), dass das Maximum von | A(a, y)| fiir 
Sala = 1, = 1 hochstens gleich dem Doppelten des Maximums von 
A(a,) fiir ist; also liegt im vorliegenden Falle | y)| unter 
2M, und Theorem I kann auch so formuliert werden: 

Befriedigt eine Belegung die eingangs dieses Paragraphen aufgefiihrten 
Voraussetzungen, so soll die aus ihren Hadamardschen Koeffizienten c, auf- 
gebaute Bilinearform D(z,y) als die zu der Belegung gehorige ,D-Form" 
bezeichnet werden. Ist die Belegung beschrankt, | f(ti)| = M, so ist die 
D-Form im Hilbertschen Sinne beschrankt, und thre obere Schranke liegt 
unter 2M. 

Die Koeffizientenmatrix einer solchen D-Form hat die Eigentiimlichkeit, 
dass die Stellen, die den nimlichen Koeffizienten tragen, je eine gerade Linie 
erfiillen und dass alle diese geraden Linien ein Strahlenbiischel bilden mit 


endlichem Zentrum : * 


Ci Co Cg Cy Cy Cg Cr Cg Co Ciz2* * 
ea 0 0 a 80 & Ce 
60 8 8 8 | 
0000 464 000 0 0 | 
10 0 000 000 0 0 

© 000004 0 0 0 
000 00000 04 0 0 
0000000000 


| 


8 Vel. eine inzwischen in der Mathematischen Zeitschrift (Bd. 2 (1918), S. 187) 


erschienene Arbeit. 
4 Wihrend sie bei den L-Formen Parallelstrahlenbiischel bilden. 
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Das System dieser D-Formen oder D-Matrizen ist isomorph den rein 
formal betrachteten zugehérigen Dirichletschen Reihen, d.h. der Summe und 
dem formalen Produkt zweier spezieller Dirichletscher Reihen gehért die Summe 
baw. das Produkt der zugehérigen D-Formen (im Sinne des Matrizenkalkiils) zu. 

Nach einem bekannten Hilfssatze von Weierstrass kann man aus der 
Positivitait des Faktors von f(ti) in (2) mehr entnehmen: 


Zusatz. Ist & der kleinste konvexe Bereich, der alle Werte der Belegung 
f(t) umschliesst, so liegt jeder Wert, den D,(x,%) fiir 1 annimmt, 
in 8. 

2. Umkehrung von Theorem I fiir absolut konvergente Dirichletsche 
Reihen. Ist f(s) = Sc,/n* absolut konvergent fiir s = ti, ist m. a. W. & | ¢n | 
konvergent, so ist Scn/n* a fortiori gleichmiassig konvergent, und es folgt aus 
den Ergebnissen von Hadamard, dass f(#) alle Voraussetzungen von § 1, auch 
die der Beschranktheit, erfillt. Es gilt dann: 


THEOREM II. Ist 3 | cn | konvergent und wird 3cn/n** gesetat, so kommt 
der in Theorem I mit Dy (x, bezerchnete Ausdruck unter der Nebenbedingung 
Wala = 1 und bei hinreichend grossem n jedem Wert der Belegung f (tt) 
beliebig nahe. 

Der Beweis werde mit dem naheliegenden Versuch begonnen 
(1) = 1/ Va, aq =1/Va, alle weiteren = 0. 


Dann ist, wenn n = a, 2,2, -+2,%,—1. Ferner kommt in D,(a, 
offenbar c, genau « mal vor, jedesmal mit dem Faktor 1/Va-1/Va=1/a, 
das gibt zusammen ¢,, wihrend c, genau 4% mal vorkommt, wenn @ gerade ist, 
und $(a—-1) mal bei ungeradem a, u.s. f.; also ist fiir das Wertsystem (1) 


1 
D,{2, = + 4¢3 + +2 oat Aa: 


wo die kleine Abweichung Ag jedenfalls der Abschatzung unterliegt 


| @ 


20 a 


| S| ce] 


Wegen der Konvergenz von & | ¢q | ist lim | ¢a | = 0, also nach dem Satz vom 
X 


a 
arithmetischen Mittel auch lim A, und D,(2,#) nahert sich fiir das 
a=o 


Wertsystem (1) und mit wachsendem « und n =a immer mehr dem Werte 
(2) + $¢.+ (1). 
So kénnte man von f(1 + i), wie eben fiir f(1), dasjenige einsehen, was von 
f(0) und f(0 + ti) gezeigt werden soll. 

Ks soll deshalb der Versuch mit anderen, komplizierteren Wertsystemen xp 
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wiederholt werden, die iibrigens den Vorteil haben werden, dass die lastige 
Abschitzung eines A, nicht auftritt. Es sei zuerst, mit | y|—=1, 
alle anderen zy = 0. 
So erhalt man neben >) 2%, = 1, wenn n hinreichend gross ist, n = 2+, 
y=1 
—1 —2 1 


oder, wenn = gesetzt wird, 


a—l1 a—2 Cy, 1 1 


d. h. das a-te arithmetische Mittel der unendlichen Reihe 
Cc 
(6) + * 


Da diese eine Teilreihe der als absolut konvergent vorausgesetzten Reihe 
Xcn/n"* ist, ist sie auch konvergent, und somit auch die Folge ihrer arith- 
metischen Mittel, und diese konvergiert gegen denselben Wert. Dem Wert 
dieser Teilreihe kommt aber zunichst die Form D, (2, 7) im angegebenen Sinne 
beliebig nahe. 

Ks sei nun weiter bei hinreichend hohem n (n = 24: 38): 


| | =1, | | 
und 
2 a-1 
V a8 Va V ap V ap 
2 a-1 
(7) Vag V ap V ap V ap 
p-1 p-1 p-1,2 p-1,a-1 
V ap Va V ap 
alle anderen zy = 0, so ist wiederum Sav%,—1, und D,(x,Z) wird gleich 
_(«#—1)8 _, (7 — 2) 
C20 + + C3. ap 
—l1 
+. 
+ 8-8-1 — 4. ¢,8-1 + 638-1, - 
aB 
1 


+ + 
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Nun ist die Doppelreihe 


Cy + + + - 
+ + + + - 
+ + + + - 


(9) 


absolut konvergent als Teilreihe der absolut konvergenten Reihe 3¢,/n**. Setzt 
man also 6 = = so kann man zuerst bei festem den Index « so 
gross wihlen, dass (8) beliebig wenig abweicht von 


+[(pB— 1) /B | {c,(1/3**) + C3.2(1/6"*) + C3,4(1/12**) +: 


d.h. von dem f-ten arithmetischen Mittel aus den Zeilensummen der Doppel- 


relhe (9). Endlich kann man also ~ so gross werden lassen, dass man den 
Summenwert der gesamten Doppelreihe (9) beliebig nahe kommt. Der Wert 
von (9) wird somit durch Werte von D,(x,Z) bei hinreichend hohem n 
approximiert ; (9) ist aber schon eine umfassendere Teilreihe von 3¢n/n** als (6). 

Setzt man allgemeiner, indem man mit py die v-te Primzahl bezeichnet, 


|m |= 1 fir 


alle anderen a — 0, so folgt durch eine analoge Schlussweise, dass man mit 


co 

Dn(«,%) bei hinreichend hohem n diejenige Teilreihe von > ¢n/n*+ approxi- 
n=1 

miert, deren Glieder einen nur durch p,,° - -, pv teilbaren Index n haben. 


Nun ist die ganze Reihe absolut konvergent; man kann also N so gross wahlen, 
dass > | cn/ntt | S de ist. Alsdann wahle man y so gross, dass alle in den 


n=N+1 


Zahlen 1,2,3,: - -,N aufgehenden Primfaktoren unter - pv enthalten 
sind. Dann erhalt man als Wert von D(a, @) eine Teilreihe, die zunachst alle 


N 

Glieder von S\c,/n‘+ enthalt und dann noch manche von dem Rest; die 
n=1 

letzteren zusammen aber betragen weniger als $e; also ist > cn/n** von einem 


n=1 
Wert von D,(z,Z) um weniger als e entfernt, q. e. d. 
Die Gréssen H,, H2,- - - bilden eine Folge zunehmender, positiver Zahlen ; 
nach Theorem I liegen sie alle unter M; also haben sie einen Grenzwert 
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lim H, = H = M. Theorem II gestattet nun, wenn M die genaue obere Grenze 
nN=CO 
von | f(t)| ist, zu folgern, dass H = M ist: 

FoucEruNG. Ist M dte obere Grenze des Betrages von Xc,/nt = f (ti) 
und lim H, = H, so ist H= M. 


n=OO 


38. Das Vertikalenmaximum. Wie bisher aufgestellten Theoreme ges- 
tatten bereits einige Anwendungen auf die Theorie der Dirichletschen Reihen, 


DEFINITION. Ich sage, eine Belegung ¢(t), —» <t< + o, nimmt 
einen Wert ,asymptotisch“” an, wenn sie ihm ausserhalb jedes Intervalls 
—TS=t=+T beliebig nahe kommt. Im gleichen Sinne spreche ich von 
dem ,asymptotischen Maximum” yon | ¢(¢)|, von dem ,asymptotischen Wert- 
vorrat” u. Ww. 

Auf Grund dieser Terminologie kann man eine Verschirfung, deren 
Theorem I fahig ist, folgendermassen formulieren: 


THEOREM III. Ist f(ti) eine Belegung, die den Voraussetzungen am 
Eingang von §1 geniigt, ist M das asymptotische Maximum von | f(ti)| und 
H = lim H, dieselbe Grosse wie in Theorem I, so tse HIE MEM. 


Zusatz. Ist K der kleinste konvexe Bereich, der den asymptotischen 
Wertvorrat von f(ti) enthalt, so legen alle Werte, die D, (a, fiir Stat =! 
annummt, in K (das ganz in & enthalten ist). 


Beweis. Ist M das asymptotische Maximum von | f(t)|, so besagt 
dies: wenn e >0 beliebig vorgegeben ist, kann man ein solches Interval 
—T=t=+T finden, dass die obere Grenze von | f(ti)| ausserhalb dieses 
Intervalls < M+ ist. Betrachtet man nun neben der Belegung f(/i) die 
andere g(ti), die fir —T=t=+T den Wert 0 hat, ausserhalb dieses 
Intervalls aber mit f(t7) iibereinstimmt, so ist die obere Grenze von | g(ti)| 
=M-+e.. Dann existieren aber die Hadamardschen Koeffizienten von q(t?) 
und sind genau dieselben wie die von f (ti) ; es wird also auch zu g(ti) dieselbe 
D-Form gehoren, wie zu f(t), mit derselben oberen Grenze H. Wendet man 
also Theorem I auf g(ti) an, so erhilt man HS M+, und da e beliebig 
klein gewihlt werden konnte, HS M. 

Nun ist unter den Voraussetzungen von Theorem I] H=M; nach 
Theorem III ist HSM, und seiner Definition nach ist MX M; also ist 
auch H= 


Sarz 1. Im Gebiete absoluter Konvergenz ist fiir eine Dirichletsche Reihe 
f(s) = lings jeder Vertikalen die obere Grenze von 


(ti 


1en, 


alls 
von 


ren 
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| f(o + t)| mit dem asymptotischen Maximum M(c) dieser Belegung iden- 
tisch. Genauer liegen alle Werte von f(o-+ ti) innerhalb des kleinsten kon- 
veren Bereiches K(o), den man um den asymptotischen Wertvorrat von 


t) legen kann. 


Dieser Satz enthilt als Folgerungen einige Theoreme, die Herr Bohr 1910 
bewiesen hat. Fiir £(s) = 31/n® namlich ist #(s) > 1 das Gebiet absoluter 
Konvergenz, und da die Koeffizienten reell-positiv sind, ist | (0 + ti)| S£(e), 
also M (co) =£(c). Satz 1 zeigt nun, dass (0 4+ tt) fiir beliebig hohe ¢ dem 
Wert {(o) beliebig nahe kommt. Da nun ¢(c) unbegrenzt wiichst, wenn o 
von rechts an 1 heranriickt, ist also (s) in der Halbebene #(s) >1 un- 
beschrinkt. Entsprechendes gilt fiir jede Reihe 3c,/n* mit lauter nicht- 


negativen reellen Koeffizienten. 


Satz 2. Innerhalb des Gebietes absoluter Konvergenz ist das Vertikalen- 
mazimum M(o) eine nirgends ansteigende Funktion der Vertikalenabscisse o. 


BeweEis. Das Vertikalenmaximum M(o) ist nach dem Ende von § 2 
identisch mit der oberen Grenze H(o) der D-Form mit den Koeffizienten c,/n?: 


fir S¢gZ,— 1. Nun geht diese Form durch die Substitution t, = n-, in 
D(é,é;0 + 2e) iiber, d.h. in die D-Form zu der um 2e weiter nach rechts 
gelegenen Vertikalen. Es ist Sarin = Sn-EnEn. Ist also SénEn S 1, so ist 
erst recht San%» aber nicht umgekehrt. Ist also | D(z, <; a) | fiir 1 
unter M(o) gelegen, so wird a fortiori | D(é,é,o + 2e)| fiir 3&2, S 1 unter 


M(c) liegen, d.h. es ist M(o + 2) = M(c). 


4. Umkehrung von Theorem I ohne einschrankende Annahme. Ks sei 
eine D-Form D(a,y) rein formal vorgelegt, und es sei lediglich von ihr 
vorausgesetzt, dass sie beschrankt ist. Daraus folgt an sich nach den allerersten 
Saitzen der Theorie der unendlich vielen Verainderlichen, dass die Quadratsumme 
|? der Betriige ihrer Koeffizienten konvergiert; es sei aber keinerlei 
weitere Annahme, wie z.B. die der Konvergenz von hinzugefiigt. 
Alsdann kann man sich, ankniipfend an die Betrachtungen von § 2, derselben 
Methode bedienen, die Herr Bohr,> auf anderen Voraussetzungen fussend, 


m einem ihnlichen Ziel benutzt hat. 
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Zuerst folgt aus der im §1 im Anschluss an Theorem I gemachten Be- 
merkung, dass alle | c, | = 2H sind, wo H die obere Grenze von | D(z, Z)| fiir 
SVa%q = 1 ist. Sei nun, mit Herrn Bohr, zunichst rein formal: 


= 2 C2" ? f2(s) =D D 
n=1 2 2 1 3 = 
co 1 


so konvergiert also f(s) fiir R(s) >0 absolut und stellt daselbst eine 
analytische Funktion dar, ebenso f2(s),fs(s),° °°. 

Ferner folgt aus der schon erwihnten Konvergenz von & | cn |* mit Hilfe 
der Schwarzschen Ungleichung, dass & | Cn |/n#*¢ fiir jedes « > 0 konvergiert, 
dass also Sc,/n* fiir #(s) > 4 absolut konvergiert. 

Endlich besagen (5) und (6) von § 2, dass das a-te arithmetische Mittel 
der Reihe f,(s) fiir st unter H gelegen ist; also ist | f1(s)| SH fiir 
R(s) > 0, und ebenso folgt aus § 2, dass | fy(s)| S H fiir R(s) > 0 und jedes 

Ist nun 1-+ 71 irgendein Punkt auf der Vertikalen mit der Abszisse 1, 
so betrachte man einen Kreis vom Radius 3 um diesen Punkt. Innerhalb 
dieses Kreises ist Sc,/n* absolut konvergent; f,(s), fe(s),- sind Teilreihen 
davon; dieselben konvergieren also gleichmassig gegen f(s). Nun besagt ein 
von Stieltjes stammender funktionentheoretischer Hilfssatz: wenn eine Folge 
analytischer Funktionen /;(s), f2(s),: in einem Kreise gleichmissig gegen 
f(s) konvergiert und wenn dieselben Funktionen fy(s) in einem grdésseren 
Gebiet dem Betrage nach unter einer gemeinsamen festen Grenze liegen, s0 
konvergieren sie in jedem beschrinkten abgeschossenen Teilbereich dieses 
groésseren Gebietes gleichmissig. Im vorliegenden Falle ergibt sich daraus, 
dass die obigen Funktionen f,(s) auch im Kreise vom Radius 1 um den Punkt 
1 + +i gleichmissig konvergieren ; die Grenzfunktion wird daselbst die analy- 
tische Fortsetzung von f(s) sein und dem Betrage nach unter H liegen. 


THEOREM IV. Ist die zur Reihe Xc,,/n* gehérige D-Form im Hilbertschen 
Sinne beschrankt und H die obere Grenze von D(a, @), so konvergiert die Rethe 
selbst absolut fiir R(s) >4 und stellt eine analytische Funktion f(s) dar, 
die bis an R(s) = 0 heran fortsetzbar ist und dem Betrage nach iiberall unter 
H liegt. 


Bonn, GERMANY. 


5H. Bohr, Géttinger Nachrichten, 1913. 


Be- 


| fiir 


MEAN MOTION. 


By HERMANN WEYL. 


The problem of mean motion which arose from the theory of the secular 
perturbations of the planetary orbits [1] and which will here find its complete 
solution, deals with the superposition of a finite number of epicycles or of 
simple oscillations : 


(1) axe (9x), 
(2) dy. Di. (4) == 


We use the abbreviation e?™ — e(#). The amplitudes a, > 0, frequencies dx, 
and initial phases a, are real constants; the dime ¢ is a real variable ranging 
from — «© to + o. The n-dimensional space of the real phases #; is to be 
construed as a torus © by identifying two points (0) = (%,---,%n) and 


= -, Wn) which are congruent modulo 1: 
= 0, (mod 1). 


zis first a function of the n phases %, and then, after the substitution (2), 
4 function of ¢. With the customary notation, polar codrdinates are used in 
the z-plane: 


g==r-e(d). 
Let us begin by discussing the phase function. (1) may be interpreted 
as depicting a (plane open) linkage AjA,-: - : An consisting of the links 
(3) Ay = 


of fixed lengths a. z= AoA» is the vector spanned by the linkage. Those 
points on the torus @ for which z = 0 form what we call the singular mani- 
fold; it consists of all possible states of a closed n-linkage with the prescribed 
sides a. HH being any area in the z-plane, the volume v(H) of that part of 
the torus where z(#,- - - 9) takes on values in FH, indicates the phase proba- 
bility of z lying in #. On account of central symmetry, it suffices to determine 


the probability 


W(r) = W(r;a,° dn) 


of the span z being in absolute value <7. By its very nature W(r) is an 
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increasing function of r varying between the limits 0 and 1. The strip 
0= W=1 in a (W, ¢)-plane is a cylinder when points (W,¢), (W, ¢’) are 
regarded equal if 6=¢’ (mod1). The phase probability v(#) is the area 
of the image of EF on the (W, #)-cylinder produced by the mapping 


a—r-e(p) > (W(r),¢). 


A straightforward analysis by Fourier transforms, carried out by A. Wintner 
[2], results, for n = 2, in the absolutely convergent expression 


in terms of the Bessel functions J,, J;. The relationship to the random walk 
problem is obvious [3]. 

We now turn to the second standpoint with ¢ as the independent variable, 
and assume that the A satisfy no homogeneous linear relation 


with integral coefficients m,. The law of equidistribution (Kronecker-Weyl) 
[4] then states that the straight line (2) fills the torus with uniform density. 
In other words, for any piece G of the torus with a Jordan volume G, the 
relative frequency with which the point #(t), moving uniformly along a 
straight line, visits G, tends to @ if the period of observation becomes in- 
finitely large, or: time-probability = phase-probability. A slightly more general 
form of the proposition asserts that for any bounded Riemann integrable func- 


(6) J, OM) at tends to dd, 


with t,—t,— «, or that the time-average of f equals its phase-average. 
Hence the considerations of the preceding paragraph determine the relative 
frequencies with which the “ planet” 2(#) visits the various parts EH of the 
z-plane.* 

However, the problem of mean motion is slightly more intricate. To 
avoid ambiguities, let us suppose that the planet never passes through the 
origin (or that the straight line 3(¢) does not hit the singular manifold). 
Then the continuous increment ¢(/2) —#(t,) of the azimuth ¢(t) of 2(t) 


1 This is to be interpreted in a figurative sense, as in the astronomical application 
based on the Copernican rather than the Ptolomaic system ¢(t) is not the longitude 
of the planet itself, but of its perihelion or its ascending node. 
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over any time interval ¢; t=, is uniquely determined. @ has a mean 
motion p if 


$(t2) — 
to — ty 


for t,—t,> 0. 


We shall see that such a mean motion yp exists, depending linearly on the 
frequencies Aj, : 


(7) Wir -f- Was. 
The complete statement is as follows. 


THEOREM. Assuming the frequencies rx to satisfy no linear relation (5) 
with integral coefficients m, of sum 0, the azimuth of the superposition (1), 
(2) of epicycles has a mean motion 


The coefficients W;, depend on the amplitudes a,,- - -, an only; they are = 0 
and their sum equals 1, so that (7) 1s a certain average of the indwidual 
frequencies Ay. More precisely, W;, is the probability 


(8) Wi = W (Ge; On * * Ax-1) 


that an (n—1)-linkage with the given sides * Spans a 
distance < Ay. 

For the proof we first return to our old hypothesis that the A, satisfy no 
linear homogeneous integral relation whatsoever, and then follow a method 
recently suggested by P. Hartmann, E. R. van Kampen, and A. Wintner [5]. 
We verify at once that the time derivative 


(t) ) {It = real part} 
arises by the substitution (2) from the following phase function: 
(dx) 
9 
Saxe (Px) 


which becomes dangerously infinite on the singular manifold. Were one 
allowed to apply to this function the principle of equidistribution (6) holding 
for bounded Riemann integrable functions f, one would at once obtain a mean 
motion 
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The y» as defined by this formula depends linearly on the Ax, (7), and its coeffi- 
cients W; are integrals of the following type: 


So far HKW. I shall now show how to carry out the integration by 3, in W,. 
Indeed, for fixed #,,- - -,%n-; and variable 3, —# we set 


n-1 


an = A, > axe (Ix) = b, 
k=1 


2—=2(0) =b+a-e(9). 
The integrand is then 


1 2 
5 
(11) ( ) 
where the index # designates derivation by 3. The integral of (11) by # is 
fd where ¢ denotes the direction of the radius vector from the origin 0 to 
the variable point z(#) describing the circle of radius a around the center b, 


and hence we obtain 1 or 0 according as the origin lies within or without the 
circle. This distinction amounts to 


(12) |b | =| S an. 


We thus find 


the integral extending over that part of the (n —1)-dimensional phase torus 


where 
| < an, 


or 
(13) Wa = W(Gn3 41° Gn-1). 
The formula (10) for 3WxAx yields at once the equation 
(14) 
by taking A; =: --—A,—1. This result has its own interest independent 


of the problem of mean motion: 


Given n positive lengths ay, let Wi be the probability that an (n—1)- 
linkage with the sides * * Spans a distance < ay. Then the 
n probabilities W,,- +, Wn yield the sum 1, 


Wr. 
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Certain examples suggest that the equidistribution law (6) might not be 
applicable to functions with the type of singularity present in ¢((#)). HKW 
get around this difficulty by resorting to Birkhoff’s general ergodic theorem. 
But then the formula is proved only with the exception of an unknown zero 
set in the space of the initial phases ¢. Our evaluation of the integral in- 
dicates, however, that the evil-spelling singularities of the integrand (9) are 
more or less bluff. For a complete proof of our theorem I therefore fall back 
on a much earlier and more elementary procedure inaugurated in 1909 by 
P. Bohl [6] for the lowest non-trivial case n = 3. 

We slit the phase torus in the (n—1)-dimensional manifold of those 


points for which z= 2z(0,- - dn) is real and negative. On the slit torus 
the azimuth $(%,- --%,) is a continuous (periodic) single-valued function 


whose values lie between — 4% and + % (limits excluded). The boundary 
of the slit is the singular manifold. On the slit itself @ has a discontinuity, 
taking on the value + % on the one, the + side, and — 1% on the — side; 


it jumps by —1 or + 1 according as one crosses the slit in the positive or 
negative sense (i.e. from the -+ to the — side, or in the opposite direction). 
Therefore ¢(¢2) —(t,) differs by a term of absolute value < % from the 
number of times NV (t,t2) the straight line (¢) crosses the slit during the time 
interval ; St=t,. In computing N(t,t.) the sense of crossing has to be 
taken into account. Let us erect a cylindrical trunk 7’ over the slit as basis 
whose generator is the velocity vector A = (A,,°:*,An). Whenever our moving 
point #(¢) crosses the slit it will stay in 7 during the following unit of time, 
so that N(t,t.) is essentially the duration of its stay in 7 between ?¢, and 2. 
Hence N (t,t.)/(t2—#,) will tend to the volume of with 
In computing that volume the multiplicity and orientation in which 7’ covers 
the various parts of the torus are to be taken into account; the “ characteristic 
function ” of the trunk, to which the law of equidistribution (6) is here 
applied, is the covering index capable of the values 0, 1,+2,:-:. p is 
perhaps more fittingly described as the flux of the constant velocity field A 
through the slit, and is given by the integral over the slit of 


| 
(15) | 
| 


dn-191, 
++, denote n—1 tangential line elements at a point (#) of the slit 


In such orientation that (15) is positive or negative according as the current 
A crosses the slit at (#) in the positive or negative sense. We thus obtain 


8 


| 
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where W, is the area 


of the vertical projection of the slit upon the (n — 1)-dimensional (0,,-- -, On-1)- 
torus with the orientation of its covering taken into account. 

When we wish to ascertain whether (and with what index) a given point 
(9: - -Wn+1) is covered by the projection, we draw the (n— 1)-linkage 
* with the links (3) starting at A, =O. Denoting by 
we have the figure as described before: the circle of radius a, =a around } 
and the origin O. We now add the negative real axis [ issuing from O, and 
we have to watch whether the point z(#) crosses [ as # varies from 0 to 1. 
z(#) will cross once and in the positive sense if | b | < a; it will not cross at 
all, or once cross and then recross if |b | >a. Thus the covering index =1 
or 0 according to the distinction (12), and our previous formula (13) is 
confirmed. 

When looked at in this way the topological situation is as plain as it 
could be. The flux of a constant velocity field through an (n — 1)-dimen- 
sional surface bounded by an (nm — 2)-dimensional cycle does not depend on 
what two-sided surface one spans into the cycle; it may be described as the 
flux encompassed by the cycle and expressed by an integral extending over 
the cycle. If we had proceeded in this way we would have got entangled in a 
cobweb of topological difficulties. The singular manifold is of a complex topo- 
logical structure which, moreover, varies with the values of the parameters «;; 
the topologically different cases seem to be separated by those “ exceptional ” 
closed n-linkages whose a, fall apart into two groups of equal sum. The u- 
ravelling of this maze is avoided by using the slit rather than the singular 
manifold, and by defining the slit precisely as we did. 

By their very definition as probabilities the Wz are positive. But since 
we no longer make use of the Wintner integral (10) an independent proof of 
(14) becomes desirable. From the definition of z(#,- - -%,) it is obvious that 


$(0, +, 0, +9) arbitrary), 


and hence 
p= SWrax 


must increase by A when all frequencies Ax are raised by the same A, or Wi =!. 
In arguing thus, one is a little bit encumbered by the assumption that the A: 
are linearly independent. But by choosing #—=— d,, 


Dn) = (9; — Dn, — Dn, 0) + On, 
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one carries over the whole discussion from the n-dimensional to an (n —1)- 
dimensional phase space and finds the linear independence of A; —An,° °°, 
An-1 — An to be a sufficient condition for our result. But this amounts to the 
absence of any relation (5) with integral coefficients mz of sum 0, as stated 
in our theorem, and this hypothesis is not affected by passing from the fre- 
quencies Ax to Ay + A. 

So far we have assumed that the planet z(¢) does not pass through the 
origin. Should this happen at some moment ft, the continuation of $(¢) 
beyond t) may become (and will in general become) ambiguous, with ¢(t) 
splitting into two equally admissible branches which differ by 1. But our 
procedure shows that such an occurrence with its attendant accumulating 
ambiguity is rare enough to be of no influence upon the asymptotic law. 

Combining the result (13) with (4) one gets for n= 3 an explicit ex- 
pression in terms of Bessel functions: 


n-1 
(16) Wn 4 J, (Qnp) IT Jo( np) dp. 


I conclude with a few remarks on the history of our question. When 
Lagrange hit upon it in his approximate theory of secular perturbations, he at 
once solved the “ trivial ” case where one of the sides, az say, is larger than the 
sum of all others, 


One then has » = A» with the much sharper estimate 


(This estimate was implied in the term “mean motion,” as Lagrange used it. 
Actually all planets are in the Lagrangean case except Venus and Earth.) 
More than a century elapsed before P. Bohl succeeded in establishing mean 
motion for n = 3 in the non-Lagrangean case where 4, d2, a3 form a triangle 
4. He found 

+ + W:As, 


7W;, being the angles of A. This result is easily obtained if after the afore- 
mentioned elementary reduction of the dimensionality of the phase space to 
n—1, one expresses the flux in terms of the singular manifold which now is 
a 0-cycle. When the problem and Bohl’s paper were pointed out to me by 
Felix Bernstein in 1913, it started me on my investigations on Diophantine 
approximations (W,). Although the essential step was accomplished by the 
equidistribution law, the messy topology of the singular manifold at that time 
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prevented me from settling the problem by an explicit and universal formula. 
As an illustration I carried out the case n 4; the topology of closed 4- 
linkages has been thoroughly investigated in kinematics because such linkages 
are a frequent element in the construction of machines. Even there I limited 
myself to one of the topological subcases, see W,, Satz 7, and blundered in the 
final formulation (although the proof is correct): since my “ integral in- 
variants ” of the rickety quadrilateral have the sum 0 instead of 1, the term 
A, should be added in the final formula. But I will not mar the simplicity 
of our present argument by these old vagaries. Awareness of those complica- 
tions will, however, help the reader to a fuller appreciation of the result (8), 
(16) holding for all cases alike. 

This formula, which served me as a lodestar, was guessed for all n and 
verified for n = 3,4 by Professor Wintner. His report on the paper HKW 
in my Princeton seminar on current literature stimulated me to resume the 
ancient problem. The various equations known about Bessel integrals of the 
type (16) for lower n are most naturally derived from the interpretation put 
upon (16) by the problem of mean motion. 


THE INSTITUTE FOR ADVANCED STUDY, 
PRINCETON, N. J. 
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THE HOMOGENEOUS CHAOS. 


By Norspert WIENER. 


1. Introduction. Physical need for theory. 

2. Definition. Types of chaos. 

3. Classical ergodic theorem. Lebesgue form. 

4. Dominated ergodic theorem. Multidimensional ergodic theorem. 
5. Metric transitivity. Space and phase averages in a chaos. 
Pure one-dimensional chaos. 

Pure multidimensional chaos. 

8. Phase averages in a pure chaos. 

9. Forms of chaos derivable from a pure chaos. 

10. Chaos theory and spectra. 

11. The discrete chaos. 

12. The weak approximation theorem for the polynomial chaos. 
13. The physical problem. The transformations of a chaos. 


1. Introduction. Physical need for theory. Statistical mechanics 
may be defined as the application of the concepts of Lebesgue integration to 
mechanics. Historically, this is perhaps putting the cart before the horse. 
Statistical mechanics developed through the entire latter half of the nineteenth 
century before the Lebesgue integral was discovered. Nevertheless, it de- 
veloped without an adequate armory of concepts and mathematical technique, 
which is only now in the process of development at the hands of the modern 
school of students of integral theory. 

In the more primitive forms of statistical mechanics, the integration or 
summation was taken over the manifold particles of a single homogeneous 
dynamical system, as in the case of the perfect gas. In its more mature form, 
due to Gibbs, the integration is performed over a parameter of distribution, 
numerical or not, serving to label the constituent systems of a dynamical 
ensemble, evolving under identical laws of force, but differing in their initial 
conditions. Nevertheless, the study of the mode in which this parameter of 
distribution enters into the individual systems of the ensemble does not seem 
to have received much explicit study. The parameter of distribution is essen- 
tially a parameter of integration only. As such, questions of dimensionality 
are indifferent to it, and it may be replaced by a numerical variable « with the 
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range (0,1). Any transformation leaving invariant the probability properties 
of the ensemble as a whole is then represented by a measure-preserving trans- 
formation of the interval (0,1) into itself. 

Among the simplest and most important ensembles of physics are those 
which have a spatially homogeneous character. Among these are the homo- 
geneous gas, the homogeneous liquid, the homogeneous state of turbulence, 
In these, while the individual systems may not be invariant under a change 
of origin, or, in other words, under the translation of space by a vector, the 
ensemble as a whole is invariant, and the individual systems are merely per- 
muted without change of probability. From what we have said, the trans- 
lations of space thus generate an Abelian group of equi-measure transformations 
of the parameter of distribution. 

One-dimensional groups of equi-measure transformations have bevome 
well known to the mathematicians during the past decade, as they lie at the 
root of Birkhoff’s famous ergodic theorem.’ This theorem asserts that if we 
have given a set § of finite measure, an integrable function f(P) on S, and 
a one-parameter Abelian group 7 of equi-measure transformations of S into 


itself, such that 
(1) TH ow), 


then for all points P on S except those of a set of zero measure, and provided 


certain conditions of measurability are satisfied, 


(2) > P) ar 


A 


will exist. Under certain more stringent conditions, known as metric transi- 


tivity, we shall have 
A 
(3) lim f f(TMP)dr— f(P)dVp 
A»0A Jo 


almost everywhere. The ergodic theorem thus translates averages over al 
infinite range, taken with respect to A, into averages over the set S of finite 
measure. Even without metric transitivity, the ergodic theorem translates 
the distribution theory of A averages into the theory of S averages. 

In the most familiar applications of the ergodic theorem, 9 is taken to be 
a spatial set, and the parameter dA is identified with the time. The theorem 
thus becomes a way of translating time averages into space averages, in 4 


1 Cf. Eberhard Hopf, “ Ergodentheorie,” Ergebnisse der Mathematik und ihrer Grenz- 
gebiete, vol. 5. See particularly § 14, where further references to the literature are givel. 
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manner which was postulated by Gibbs without rigorous justification, and which 
forms the entire basis of his methods. Strictly speaking, the space averages 
are generally in phase-space rather than in the ordinary geometrical space of 
three dimensions. There is no reason, however, why the parameter \ should 
be confined to one taking on values of the time for its arguments, nor even 
why it should be a one-dimensional variable. We are thus driven to formulate 
and prove a multidimensional analogue of the classical Birkhoff theorem. 

In the ordinary Birkhoff theorem, the transformations 7 are taken to 
be one-one point transformations. Now, the ergodic theorem belongs funda- 
mentally to the abstract theory of the Lebesgue integral, and in this theory, 
individual points play no réle. In the study of chaos, individual values of 
the parameter of integration are equally unnatural as an object of study, and 
it becomes desirable to recast the ergodic theorem into a true Lebesgue form. 
This we do in paragraph 2. 

Of all the forms of chaos occurring in physics, there is only one class 
which has been studied with anything approaching completeness. This is the 
class of types of chaos connected with the theory of the Brownian motion. In 
this one-dimensional theory, there is a simple and powerful algorithm of phase 
averages, which the ergodic theorem readily converts into a theory of averages 
over the transformation group. This theory is easily generalized to spaces of 
a higher dimensionality, without any very fundamental alterations. We shall 
show that there is a certain sense in which these types of chaos are central in 
the theory, and allow us to approximate to all types. 

Physical theories of chaos, such as that of turbulence, or of the statistical 
theory of a gas or a liquid, may or may not be theories of equilibrium. In the 
general case, the statistical state of a chaotic system, subject to the laws of 
dynamics, will be a function of the time. The laws of dynamics produce a 
continuous transformation group, in which the chaos remains a chaos, but 
changes its character. This is at least the case in those systems which can 
continue to exist indefinitely in time without some catastrophe which essen- 
tially changes their dynamic character. The study, for example, of the de- 
velopment of a state of turbulence, depends on an existence theory which avoids 
the possibility of such a catastrophe. We shall close this paper by certain very 
general considerations concerning the demands of an existence theory of this 
sort. 


2. Definition. Types of Chaos. A continuous homogeneous chaos in 
n dimensions is a scalar or vector valued measurable function +, 
of in which assume all real values, while ranges 


over (0,1); and in which the set of values of a for which 
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(4) + 915° + Yyn3%) belongs to 8S, 


if it has a measure for any set of values of y,- - -, yn, has the same measure 
for any other set of values. In this paper, we shall confine our attention to 
scalar chaoses. A continuous homogeneous chaos is said to be metrically 
transitive, if whenever the sets of values of a for which - 
belongs to S and to S,, respectively, have measures M and M,, the set of values 


of a for which simultaneously 


*,%n;%) belongs to 
and 
p(%i + %1,°° belongs to S, 
has a measure which tends to M, as y,7>+---+ yn? ow. 


If p is integrable, it determines the additive set-function 


On the other hand, not every additive set-function may be so defined. This 
suggests a more general definition of a homogeneous chaos, in which the chaos 
is defined to be a function (3; «), where @ ranges over (0,1) and & belongs 
to some additively closed set = of measurable sets of points in n-space. We 


suppose that if 3} and %, do not overlap, 


(6) 215%) = + (2132). 

We now define the new point-set Yn) by the assertion 

(7) Yn) contains yn when and only when 


This leads to the definition of the additive set-function %y,,...,,(%3%) by 

If, then, for all classes S of real numbers, 

(9) Measure of set of a’s for which Sy, ...,1,(%3;%) belongs to class 5 


is independent of y,,° - -, yn, in the sense that if it exists for one set of these 
numbers, it exists for all sets, and has the same value, and if it is measurable 


-, Yn, We shall call % a homogeneous chaos. The notion of metrical 


in 
an3%) by 


transitivity is generalized in the obvious way, replacing p(%,° 
The theorem which we wish to prove is the following: 
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TuHeorEeM I. Let §(3;a) be a homogeneous chaos. Let the functional 


“1 
be a measurable function of a, such that f | g(a) logt | g(a)| | da is finite. 
0 


Then for almost all values of a, 


r>co V 


1 ‘ 
R 
exists, where R is the interior of the sphere 


and V(r) is its volume. If in addition, §(%;%) ts metrically transitive, 


for almost all values of «@. 


3. Classical ergodic theorem. Lebesgue form. ‘Theorem I is mani- 
festly a theorem of the ergodic type. Let it be noticed, however, that we 


nowhere assume that the transformation of « given by B = T'a when 
CY Cy 


is one-one. This should not be surprising, as the ergodic theorem is funda- 
mentally one concerning the Lebesgue integral, and in the theory of the 
Lebesgue integral, individual points play no rdle. 

Nevertheless, in the usual formulation of the ergodic theorem, the ex- 
pression f(7P) enters in an essential way. Can we give this a meaning 
without introducing the individual transform of an individual point? 

The clue to this lies in the definition of the Lebesgue integral itself. If 
{(P) is to be integrated over a region S, we divide S into the regions Sy4(f), 


defined by the condition that over such a region, 


(15) a<f(P) Sb. 

We now write 

(16) ( f(P)dVp =lim ne 
e S 


The condition that f(P) be integrable thus implies the condition that it be 
measurable, or that all the sets Sz,,(f) be measurable. 


Now, if 7 is a measure-preserving transformation on S, the sets 7’S.,1(f) 
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will all be measurable, and will have, respectively, the same measures as the 
sets San(f). We shall define the function f(7P) =g(P) by the conditions 


(17 T'Sa,v(f) = Sav(g). 


If T conserves relations of inclusion of sets, up to sets of zero measure, this 
function will clearly be defined up to a set of values of P of zero measure, 


f, f(P)dVp. 


We may thus formulate the original or discrete case of the Birkhoff 
ergodic theorem, as follows: Let S be a set of points of finite measure. Let T 
be a transformation of all measurable sub-sets of S into measurable sub-sets 
of S, which conserves measure, and the relation between two sets, that one 


and we shall have 


(18) f(TP)dVp 


contains the other except at most for a set of zero measure. Then except for 
a set of points P of zero measure, 


(19) 2 f(T"P) 


will exist. 

The continuous analogue of this theorem needs to be formulated in a 
somewhat more restricted form, owing to the need of providing for the in- 
tegrability of the functions concerned. It reads: Let S be a set of points of 
finite measure. Let T* be a group of transformations fulfilling the conditions 
we have laid down for T in the discrete case just mentioned. Let T*P be 
measurable in the product space of X and of P. Then, except for a set of 


points P of zero measure, 


(20) lim 
A 0 
will exist. 

In the proofs of Birkhoff’s ergodic theorem, as given by Khintchine and 
Hopf, no actual use is made of the fact that the transformation 7’ is one-one, 
and the proofs extend to our theorem as stated here, without any change. The 
restriction of measurability, or something to take its place, is really necessary 
for the correct formulation of Khintchine’s statement of the ergodic theorem, 
as Rademacher, von Neumann, and others have already pointed out.’ 

With the aid of the proper Lebesgue formulation of the ergodic theorem, 
the one-dimensional case of Theorem I follows at once. Actually more follows, 
as it is only necessary that g belong to L, instead of to the logarithmic class 


2Cf. J. v. Neumann, Annals of Mathematics, 2, vol. 33, p. 589, note 11. 
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with which we replace it. To prove Theorem I in its full generality, we must 
establish a multidimensional ergodic theorem. 


4. Dominated ergodic theorem. Multidimensional ergodic theorem. 
As a lemma to the multidimensional ergodic theorem, we first wish to establish 
the fact that if the function f(P) in the ergodic theorem satisfies the condition 


(21) J, | FP) | F(P)| < 


then the expressions (19) and (20) not only exist, but the limits in question 
will be approached under the domination of a summable function of P. We 
shall prove this in the discrete case, for the sake of simplicity, but the result 
goes over without difficulty to the continuous case. 

Let 7’ be an equimeasure transformation of the set S of finite measure 
into itself, in the generalized sense of the last paragraph, and let W be a 
measurable sub-set of S, with the characteristic function W(P). Let U be 
the set of all points P for which some 7’/P belongs to W. Let 1(P), when 
P belongs to W, be defined as the smallest positive number n such that 7-"P 
belongs to W, and let us call it the index of P. Every point of W, except for 
a set of measure 0, will have a finite index, since if we write Woo for the set 
of points without a finite index, no two sets T”We and T”Wo can overlap, 
while they all have the same measure and their sum has a finite measure. 
Thus except for a set of zero measure, we may divide W into the sets Wp, 
each consisting of the points of W of index p. It is easy to show that if 
<p’, the sets T-iW, and 
can not overlap over a set of positive measure unless j = 7’, p= p’. Similarly, 
the sets 7-?W, and 7-’W,, can not overlap over sets of positive measure, unless 
p= p’, and represent a dissection of W, except for a set of zero measure. Let 
us put Wyq for the logical product of Wp, and T?Wq; Wpgr for the logical 
product of W,, and and so on. Then if k is fixed, the sets T-4Wo,», . . . 
cover U (except for sets of zero measure) once as 7 goes from 0 to p,, once as 
it goes from p, to p; + ps, and so on; making just & times between 0, inclusive, 
and p, + po+---+ mm, exclusive. Thus if Sx is the set of all the points 
in all the Wy»... for which p, + po ++ pr=KXK, the total measure 
of all the Sx’s for 24+ = K = 2% can not exceed km(S) /2%. 

Now let P lie in WhereOSj << pit pot px Let 


us consider the sequence of numbers 4j, where if ppt 
<pt+:-:-+ pus, aj is the greatest of the numbers 
1 2 
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Then a; will be the largest of the numbers 


W(P) + W(TP) + W(TP) + 
W(P), + W(TIP) | 


The sum > aj, for a fixed K, will have its maximum gas when p; = p2="": 
= = = K + 1—k, when it will be + > k/j S=k(1 + log K/h), 
In this case the sequence of the a;’s will be 
k k k 
k times 
This remark will be an easy consequence of the following fact: let us 


consider the sequence 


and the modified sequence 


(22) 


(23) 
where of course 
> 
A+1 At] 


n—17] 


Apart from the arrangement, the terms of (22) will be the same as the terms 


A+ 1 
+ - replaces A/n. Now, 


of (23), except that in (23), 


so that the sum of the terms in (23) is greater than that in (22). 


i 
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Since the transforms of a given set have the same measure as the set, and 
the sets T’-/Sx cover U exactly k times, we have 


W(P) + max (wir), + 
k Js | 5 
Tk), +. max (W(P), W(P) + +0 


k 


K 
= 3 m(Sx) (1 + log — ) 
t us 4 
QN km (8) 
<= i y 
= m(W) + log )+3 (1 + log ) 
< (1+ log ~)} m(W) + const. 
the constant being absolute. If we now put 


km(S 
N= [ tog aw / log 2 | ; 


this last expression is dominated by 


(S) 
const. m(W) (: + log | 
Since the constant is independent of k, we see that the Cesaro average of 


is dominated by the same term. Thus 


m(S) 
= const. m(W) + log mr) 
and it follows by monotone convergence that there exists a function W*(P) 
such that 
m(S) 
(24) j, W*(P)dVp S const. m(W) ( 1 + log m(W) 


and for all positive m, 


| 
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Now let f(P) be a function such that 


(21) J, IPP) log" | < 
Let WY) be the. set of points such that 
(26) QV < | f(P)| 


and let f‘)(P) be the function equal to f(P) over this set of points, and 0 
elsewhere. Then there exists a function f'*(P), such that 


(27) (P) + =f'™)*(P) (m—0,1,2,---), 


and 
(N)*( Pp (N) (8) ON 
(28) (P)dVp const. m(W ) (1 + 10g )® 


Hence if 


00 m(S 

(29) (1 + log* < 
and 
(30) ft(P) 
then 

f(P) +: --+f(T*P) 
m+1 f*(P) (m = 0, 1, 2, ); 
and 


(32) f*(P)dVp S const. m(W™) (1 + log* On. 


However, 


(88) (1 + on 


< | f(P)| dVp + log* 2N+1m (9) 


J WIN) | f(P)| dVp 


| f(P) 1 + logs 21 m(8) dVp 
Je f(Q)dVoq 


2m(8) 
| f(P)| dVp [1 + logt 


+ | #(P)| log* | f(P)| dVp. 
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Thus f. f*(P)dVp has an upper bound which is less than a function of 


f. | f(P)| dVp and J, | f(P)| log* | f(P)| dVp, tending to 0 as they both 


tend to 0. This establishes our theorem of the existence of a uniform dominant. 
There is a sense in which (21) is a best possible condition. That is, if 


(34) ¥(2) o(log’ 
the condition 


is not sufficient for the existence of a uniform dominant. For let § be a set 
of measure 1, subdivided into mutually exclusive sets S,, of measures respec- 
tively Let S, be divided into mutually exclusive sets Sn,v,, all 
of equal measures. Let T transform into (& < v"), and Sn,v, into 
Sn. Let f(P) be defined by 


(36) f(P)=a&>0on Sn, (n=—1,2,:--); f(P) =0 elsewhere. 


Then the smallest possible uniform dominant of 


N 
(37) Sure) 
is 
(38) f*(P) on 
and we have 
Ss 
Thus if 
(40) Vn = 27" An, an = Q(n !), 


the function f*(P) will belong to Z if and only if 


(log an) 2-" 


Vn 


(41) 


> const. log* f(P) dV p. 


While we have proved the dominated ergodic theorem merely as a lemma 
for the multidimensional ergodic theorem, the theorem, and more particularly, 
the method by which we have proved it, have very considerable independent 
interest. We may use these methods to deduce von Neumann’s mean ergodic 
theorem from the Birkhoff theorem; or vice versa, we may deduce the Birkhoff 
theorem, at least in the case of a function satisfying (21), from the von 
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Neumann theorem. These facts however are not relevant to the frame of 
the present paper, and will be published elsewhere. 

We shall now proceed to the proof of the multidimensional ergodic 
theorem, which we shall establish in the two-dimensional case, although the 
method is independent of the number of dimensions. Let 77." be a two- 
dimensional Abelian group of transformations of the set S (of measure 1) 
into itself, in the sense in which we have used this term in paragraph 3. Let 
T,\T.“P be measurable in A, », and P. We now introduce a new variable 2, 
ranging over (0,1), and form the product space § of P and z. We introduce 
the one-parameter group of transformations of this space, 7’, by putting 


The expressions 7°(P,x) will be measurable in p and (P, x), and the trans- 
formations 7° will all preserve measure on ¥. Thus by the ergodic theorem, 
for almost all points (P,x) of 3%, if f(P) =f(P,7z) belongs to L, the limit 


A A 
(43) lim f(T°(P, x))dp =lim f sin 242 P) dp 
A-0o A 0 A 0 


will exist. If condition (26) is satisfied, it will follow by dominated con- 
vergence that the limit 


A 1 
44 lim 1 d cos sin dz 
A 
0 


will exist for almost all points (P,~), and hence for almost all points P. 

For the moment, let us assume that f(P) is non-negative. Then there 
is a Tauberian theorem, due to the author,*® which establishes that the expres- 
sion (44) is equivalent to 


1 A iad 
00 


The only point of importance which we must establish in order to justify this 
Tauberian theorem is that 


1 
1+iu 


for real values of u. Since every function f(P) satisfying (21) is the dif- 
ference of two non-negative functions satisfying this condition, (45) is estab- 


lished in the general case. 
It will be observed that we have established our multidimensional ergodic 


’N. Wiener, “ Tauberian theorems,” Annals of Mathematics, 2, vol. 33, p. 28. 
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theorem on the basis of assumption (21), and not on that of the weaker 
assumption that f(P) belongs to Z. What the actual state of affairs may be, 
we do not know. At any rate, all attempts to arrive at a direct analogue of 
the Khintchine proof for one dimension have broken down. The one-dimen- 
sional proof makes essential use of the fact that the difference of two intervals 
is always an interval, while the difference between two spheres is not always 
a sphere. 

The precise statement of the multidimensional ergodic theorem is the 
following: Let S be a set of points of measure 1, and let T,“T.- - - Ty be 
an Abelian group of equimeasure transformations of S into itself, in the sense 
of paragraph 3. Let T\4- - - TP be measurable in \4,---,An; P. Let R 
be the set of values of An for which 


and let V(r) be its volume. Let f(P) satisfy the condition (21). Then for 
almost all values of P, 


V 
eaists. 
That part of Theorem I which does not concern metric transitivity is an 
immediate corollary. 


5. Metric transitivity. Space and Phase Averages in a Chaos. If the 
function f(P) is positive, clearly 


(49) f f(T > TP) da, dXn 
= f(T; Ay TP) dr, dau: 


Hence 


and expression (48) has the same value for P and all its transforms under 
the group 7: - - 7". The condition of positivity is clearly superfluous. 
Thus in case expression (48) does not almost everywhere assume a single 
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value, there will be two classes S, and S2 of elements of S, each of positive 
measure, and each invariant under all the transformations 7“: - - Tn™. 

A condition which will manifestly exclude such a contingency is that if 
S, and S, are two sub-sets of S of positive measure, and e is a positive quantity, 


there always exists a transformation T =—T,™- - -T,*, such that 
| mS 
(51) | — ( —mS, | <e. 
mS 


From this it will immediately follow that if a chaos is metrically transitive in 
the sense of paragraph 3, the group of transformations of the a space generated 
by translations of the chaos will have the property we have just stated, and 
under the assumptions of Theorem I, 

700 


will exist and have the same value for almost all values of @. 
If almost everywhere 


1 


rox 


1 = 


where g(«) belongs to L, then by dominated convergence, 


| 
1 
(254) 


That is, the average of ®{%(%;«)}, taken over the finite phase space of 2, is 
almost everywhere the same as the average of ®{%y,,..., y,(%3%)} taken over 
the infinite group space of points y;,° - *,Yn. This completes the establish- 
ment of Theorem I, and gives us a real basis for the study of the homogeneous 
chaos.* 

6. Pure one-dimensional chaos. The simplest type of pure chaos is 
that which has already been treated by the author in connection with the 
Brownian motion. However, as we wish to generalize this theory to a multi- 


‘The material of this chapter, in the one-dimensional case, has been discussed by 
the author with Professor Eberhard Hopf several years ago, and he wishes to thank 
Professor Hopf for suggestions which have contributed to his present point of view. 


po 
TI 
see 


ob 


sp 
su 
wl 
int 
bin 
dis 


vol 


p 
W 
d 
a 
( 
al 
m 
tl 
ce 
if 

and 
by 

ch 
dS 
ne 
or 
wl 


THE HOMOGENEOUS CHAOS. 911 


plicity of dimensions, instead of referring to existing articles on the subject, 
we shall present it in a form which emphasizes its essential independence of 
dimensionality. 

The type of chaos which we shall consider is that in which the expression 
(3a) has a distribution in a dependent only on the measure of the set 3; 
and in which, if %, and 3, do not overlap, the distributions of %(%,,«) and 
y(X2,%) are independent, in the sense that if ¢(2,y) is a measurable func- 
tion, and either side of the equation has a sense, 


(56) ff )dadp— f a), a) 


We assume a similar independence when n non-overlapping sets 31,° °°, =n 
are concerned. It is by no means intuitively certain that such a type of chaos 
exists. In establishing its existence, we encounter a difficulty belonging to 
many branches of the theory of the Lebesgue integral. The fundamental 
theorem of Lebesgue assures us of the possibility of adding the measures of a 
denumerable assemblage of measurable sets, to get the measure of their sum, 
if they do not overlap. Accordingly, behind any effective realization of the 
theory of Lebesgue integration, there is always a certain denumerable family 
of sets in the background, such that all measurable sets may be approximated 
by denumerable combinations of these. This family is not unique, but without 
the possibility of finding it, there is no Lebesgue theory. 

On the other hand, a theory of measure suitable for the description of a 
chaos must yield the measure of any assemblage of functions arising from a 
given measurable assemblage by a translational change of origin. This set of 
assemblages is essentially non-denumerable. Any attempt to introduce the 
notion of measure in a way which is invariant under translational changes of 
origin, without the introduction of some more restricted set of measurable sets, 
which does not possess this invariance, will fail to establish those essential 
postulates of the Lebesgue integral which deal with denumerable sets of points. 
There is no way of avoiding the introduction of constructional devices which 
seem to restrict the invariance of the theory, although once the theory is 
obtained it may be established in its full invariance. 

Accordingly, we shall start our theory of randomness with a division of 
space, whether of one dimension or of more, into a denumerable assemblage of 
sub-sets. In one dimension, this division may be that into those intervals 
whose codrdinates are terminating binary numbers, and in more dimensions, 
into those parallelepipeds with edges parallel to the axes and with terminating 
binary codrdinates for the corner points. We then wish to find a self-consistent 
distribution-function for the mass in such a region, dependent only on the 


volume, and independent for non-overlapping regions. 
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This problem does not admit of a unique solution, although the solution 
becomes essentially unique if we adjoin suitable auxiliary conditions. Among 
these conditions, for example, is the hypothesis that the distribution is sym- 
metric, as between positive and negative values, has a finite mean square, and 
that the measure of the set of a’s for which %(S;@) >A is a continuous 
function of A.° Without going into such considerations, we shall assume 
directly that the measure of the set of instances in which the value of 3(S; 2) 
in a region S of measure M lies between a and b >a, is 


1 b 
57 exp 7) du. 
(57) View Jo \ 2M 


The formula 


(58) exp ( ) du 


1 v? 
shows the consistency of this assumption. 

The distributions of mass among the sets of our denumerable assemblage 
may be mapped on the line segment 0 =a=1, in such a way that the 
measure of the set of instances in which a certain contingency holds will go 
into a set of values of « of the same measure. This statement needs a certain 
amount of elucidation. To begin with, the only sets of instances whose 


measures we know are those determined by 


ay = a) b, 


= (823%) S be 
(59) 
An <= a) = ba; 
where S,,S2,: --,S, are to be found among our denumerable set of sub- 


divisions of space. However, once we have established a correspondence between 
the measures of these specific sets of contingencies and their corresponding 
sets of values of «, we may use the measure of any measurable set of values 
of a to define the measure of its corresponding set of contingencies. 

The correspondence between sets of contingencies and points on the line 
(0,1) is made by determining a hierarchy of sets of contingencies 


a, a) <= b, (mn) 


(60) 
(by (mn) a) <= by lm), 


5 Cf. the recent investigations of Cramér and P. Lévy. 
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Let us call such a contingency C'm,n». If m is fixed, let all the contingencies 


(1 = 1, be mutually exclusive, and let them be finite in number. 
Let us be able to write 

N 

k=1 


If 8’ is one of our denumerable sets of regions of space, let every Cm,n with a 
sufficiently large index m be included in a class determined by a set of con- 
ditions concerning the mass on S’ alone, and restricting it to a set of values 
lying in an interval (c,d), corresponding to an integral of form (57) and of 
arbitrarily small value. (Here d may be «, or c may be — o.) Let us put 
C;,, for the entire class of all possible contingencies, and let us represent it 
by the entire interval (0,1). Let us assume that C'm,, has been mapped into 
an interval of length corresponding to its probability, in accordance with (57), 
and let this interval be divided in order of the sequence of their n,’s into 
intervals corresponding respectively to the component Cns1,n,’8, and of the 
same measure. [Except for a set of points of zero measure, every point of the 
segment (0,1) of « will then be determined uniquely by the sequence of the 
intervals containing it and corresponding to the contingencies Cm, for suc- 
cessive values of m. This sequence will then determine uniquely (except in a 
set of cases corresponding to a set of values of « of zero measure) the value 
of §(Sn;«) for every one of our original denumerable set of sets Sn. 

So far, everything that we have said has been independent of dimen- 
sionality. We now proceed to something belonging specifically to the one- 
dimensional case. If the original sets 8, are the sets of intervals with binary 
end-points, of such a form that they may be written in the binary scale 


where d,,° - -, d; are digits which are either 0 or 1, then any interval whatever 
of length not exceeding 2“ and lying in (a,6) (where a and 6 are integers) 
may be written as the sum of not more than two of the (b — a) 2? intervals 
of form (62) lying in (a,b) and of length 2", not more than two of the 
(6 —a)2#*? intervals of length 2°, and so on. The probability that the 
value of | §(Sn;@)| should exceed A, or in other words, the measure of the 


set of «’s for which it exceeds A, is 
(63) x ( ) au o{exp(— Am(S,,) +) } 
V JA 2m (Sn) 
Now let us consider the total probability that the value of | 3(S,: a) | 
should exceed 2-(¥*) 4) for any one of the (b —«)2#*! intervals of length 
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2-41, or 2-(+2) for any one of the (b —a)2** intervals of length 
or soon. This probability can not exceed 


(64) (b — a) o(exp(— €)) exp(— 
k=1 


On the other hand, the sum of | §(S,;)| for all the 2+ 2-4: - - intervals 
must in any other case be equal to or less than 


k=1 


Thus there is a certain sense in which over a finite interval, and except for a 
set of values of a of arbitrarily small positive measure, the total mass in a 
sub-interval of length = 2 tends uniformly to 0 with 2-“. On this basis, 
we may extend the functional %(S;) to all intervals S. It is already defined 
for all intervals with terminating binary end-points. If (c,d) is any interval 


whatever, let ¢,,¢2,: - - be a sequence of terminating binary numbers ap- 
proaching c, and let d,,d2,- - - be a similar sequence approaching d. Then 


except for a fixed set of values of « of arbitrarily small measure, 


(66) lim | (Cn, dn) + — 3( (em, dm ) x) | | 


M,N 
and we may put 


(67) (c,d); =lim (cm, dm); 


OO 


Formula (57), and the fact that 3(S,;«) and %(S2; a) vary independently 
for non-overlapping intervals S,; and S2, will be left untouched by this extension. 

Thus if = is an interval and 7” a translation through an amount A, 
we can define (73; a), and it will be equally continuous in A over any finite 
range of A except for a set of values of « of arbitrarily small measure. From 
this it follows at once that it is measurable in A and « together. Furthermore, 
we shall have 

Measure of set of @’s for which %(7°3; a) belongs to C = 


vr) Measure of set of as for which %(%;a) belongs to C. 


Thus %(%;@) is a homogeneous chaos. We shall call it the pure chaos. 

If &{%(%;a)} is a functional dependent on the values of (3%; ) for a 
finite number of intervals =,, then if A is so great that none of the intervals 
overlaps any translated interval ®{% (3; «%)} will have a distribution 
entirely independent of ©{%§}(7*S;«)}. As every measurable functional may 
be approached in the LZ sense by such a functional, we see at once that §(: 2) 
is metrically transitive.® 


° Except that the method of treatment has been adapted to the needs of §7, the 
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¢7. Pure multidimensional chaos. In order to avoid notational com- 
plexity, we shall not treat the general multidimensional case explicitly, but 
shall treat the two-dimensional case by a method which will go over directly 
to the most general multidimensional case. If our initial sets Sn are the 
rectangles with terminating binary codrdinates for their corners and sides 
parallel to the axes, and we replace (a,b) by the square with opposite vertices 
(p,q) and (p+r, q+ 7), an argument of exactly the same sort as that 
which we have used in the last paragraph will show that except in a set of 
cases of total probability not exceeding 


OO x 
(69) const. o (exp(— ) — 0 (2 exp(— ) 
k=1 


the sum of | %(S,;)| for a denumerable set of binary rectangles with base 
< 2, of the form (62), and adding up to make a vertical interval lying in 
the square (p,q), (p+7,q¢ +7), must be equal to or less than 


(65) (Z-€) — O ), 
k=1 


If we now add this expression up for all the base intervals of type (62) neces- 
sary to exhaust a horizontal interval of magnitude not exceeding 2, we shall 
again obtain an expression of the form (65). It hence follows that if we take 
the total mass on the codrdinate rectangles within a given square, this will 
tend to zero uniformly with their area, except for a set of values of @ of 
arbitrarily small measure. From this point the two-dimensional argument, 
and indeed the general multidimensional argument, follows exactly the same 
lines as the one-dimensional argument. It is only necessary to note that if 


An 4, bn > b, Cn 6, dnd 


then the rectangles (dm, 0m), (Cm; dm) and (dn, bn), (¢n.dn) differ at most by 
four rectangles of small area.’ 
From this point on, we shall write P(S; a) for a pure chaos, whether in 


one or in more dimensions. 


8. Phase averages in a pure chaos. If f(/) is a measurable step- 
function, the definition of 


(70) (8; 2) 


results of this section have previously been demonstrated by the author. (Proceedings 
of the London Mathematical Society, 2, vol. 22 (1924), pp. 454-467). 
7 Here we represent a rectangle by giving two opposite corners. 
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is obvious, for it reduces to the finite sum 

N 

(71) > fa? @) 

1 


where fn are the N values assumed by f(P), and S, respectively are the sets 
over which these values are assumed. Let us notice that 


1 a N WN 1 a 
(72) dal f(P)dp® (830)? (Sn; 2) (Su 5 2) 


m=1 n=le 


N 1 
(P (Sn; a) )2da 


| fn |? ex ( ) du 
2am (Sn) %-co 2m (Sn) 


N 
1 
V 2a J -00 
N 


the integral being taken over the whole of space. In other words, the trans- 


formation from f(P) as a function of P, to fryer (S;a) as a function 
of a, retains distance in Hilbert space.* Such a transformation, by virtue of 
the Riesz-Fischer theorem, may always be extended by making limits in the 
mean correspond to limits in the mean. Thus both in the one-dimensional 
and in the many-dimensional case, we may define 


(73) f(P)dp? (S34) —1.i.m. (S52) 


where is a function belonging to and the sequence f;(P), fe(P),° 
is a sequence of step-functions converging in the mean to f(P) over the whole 
of space. The definition will be unambiguous, except for a set of values of « 
of zero measure. 

If S is any measurable set, we have 


(74) J, 219 (8:2) = u" exp (— du 


1 
= (m(8))"/” du 
V 2a J-x 


=( if n is odd 
= (m(S) )"/?(n —1)(n—3)-- -1 if n is even. 


°Cf. Paley, Wiener, and Zygmund, Mathematische Zeitschrift, vol. 37 (1933), 
pp. 647-668. 
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This represents (m(S))"/?, multiplied by the number of distinct ways of 
representing » objects as a set of pairs. Remembering that if S,,S.,°- -, Sen 
are non-overlapping, their distributions are independent, we see that if the 
sets 31, X2,° °°, Son are either totally non-overlapping, or else such that when 


two overlap, they coincide, we have ° 
1 
(75) J (3,34): P(3,5a)da = P(X; a) da, 


where the product sign indicates that the 2n terms are divided into n sets of 
pairs, 7 and &, and that these factors are multiplied together, while the addi- 
tion is over all the partitions of 1,- - -,2n into pairs. If 2n is replaced by 
2n + 1, the integral in (75) of course vanishes. 

Since #(S;a) is a linear functional of sets of points, and since both 
sides of (75) are linear with respect to each P (3%; «) separately, (75) still 
holds when 3;, 32,° - -, en can be reduced to sums of sets which either coin- 
cide or do not overlap, and hence holds for all measurable sets. 

Now let f(P;,: - -, Pn) be a measurable step-function: that is, a function 
taking only a finite set of finite values, each over a set of values P,,-- -, Px 
which is a product-set of measurable sets in each variable Px. Clearly we 


may define 


in a way quite analogous to that in which we have defined (70), and we shall 


have 


1 
(77) f da f Pu)dp,P (S34) ++ «dp, P(S; a) 
0 


where the summation is carried out for all possible divisions of the 2n P’s 
into pairs. Similarly in the odd case 


We may apply (77) to give a meaning to 
1 


If f(P;,- - Pn) is a measurable step-function, and 


® Cf. Paley, Wiener, and Zygmund, loc. cit., formula (2.05). 
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(80) | f(P1,° = | f.(P:) - | 3 
| fx(P)|?dVpS A (ax 1,2,- + -) 


we shall have 


S A"(2n 


If now f(P:,---,Pn) is an integrable function satisfying (80), but not 
necessarily a step-function, let 


Clearly almost everywhere 


(83) f(v3Pi,° -,Pn)dp,P (S30) -dp,P(S; a) 
= I f fx(P)dpP (S; a) 


and 

(84) Tim | ff f(us (83.4) P (S34) 

fel P) de® (8; a) 

(8; 2) 


where & represents the exterior of a sphere of arbitrarily large volume. Let 
it be noted that both the numerator and the denominator of this fraction have 
Gaussian distributions, but that the mean square value of the numerator is 
arbitrarily small. Thus except for a set of values of « of arbitrarily small 
measure, the right side of expression (84) is arbitrarily small, so that we may 
write 


(85) lim {f- +, Pn)dp,P (S30) -dp,P(S; a) 


®, 


ffs +, Pn)dp,P -dp,P (8; a) = 


Thus by dominated convergence, 
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exists for almost all values of «, and we may write it by definition 


This will clearly be unique, except for a set of values of a of zero measure. 
There will then be no difficulty in checking (77), (78), and (81). 


9. Forms of chaos derivable from a pure chaos. Let us assume that 
f(P) belongs to L’, or that f(P:,: + -, Pn) is a measurable function satisfying 


(80). Let us write PQ for the vector in n-space connecting the points P and 
Q. Then the function 


(88) f f(PP,,: PP,)dp,P (S; a) dp, P (S34) = F(P; a) 


is a metrically transitive differentiable chaos. This results from the fact that 
?#(S;a) is a metrically transitive chaos, and that a translation of P gen- 
erates a similar translation of all the points P;. The sum of a finite number 
of functions of the type (88) is also a metrically transitive differentiable chaos. 
To show that /(P;a) is measurable in P and @ simultaneously, we merely 
repeat the argument of (83)—(86) with both P and «@ as variables. 

We shall call a chaos such as (88) a polynomial chaos homogeneously of 
the n-th degree, and a sum of such chaoses a polynomial chaos of the degree 
of its highest term. In this connection, we shall treat a constant as a chaos 
homogeneously of degree zero. 

By the multidimensional ergodic theorem, if ® is a functional such that 


(89) | | log" | #(F(P32))| da < 
we shall have 


for almost all values of « Since the distribution of /’(P;«) is dominated by 
the product of a finite number of independent Gaussian distributions, we 


even have 
1 

(91) f | F(P;a)|"da< 
0 


for all positive integral values of n. In a wide class of cases this enables us 


to establish a relation of the type of (89). 
In formula (90), we have an algorithm for the computation of the right- 


hand side. For example, if 
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(92) F(P32) f(PP,)de,® (832), 

and P + Q is the vector sum of P and Q, we have for almost all a, 

(93) lim f + Q)F(Q)aVe, 


the integral being taken over the whole of space; if 


(94) -F(P; 2) 3 (PP, PP.) dp,P (S; a) dp,P? (S; a), 


we. have almost always 


(95) lim ay P(P + 


f f(P+Q,P + Q)dVodVu; 


and if 
(96) P(P3a)— f(PP.,PP., PP.) dp, (934) (832) (532), 


we have almost everywhere 


—S Sf 


+ §(Q,Q,P + M)j(S, M, 8) + £(Q,Q,P + U)](S, 8, 
+ (Q,P + M,Q)f(M,S, 8) +f(Q,P + M, Q)f(S, M, 8) 
+f(Q,P + M, Q)f(S,8,M) + f(P+M, Q, Q)f(M, 8, 8) 
+ f(P+M, Q, Q)f(8,M, 8) +f(P + M, Q, Q)f(8, 8, M) 
M,8) 
+f(P+Q,P+M,P+8)f(M, 8, Q) 
M, Q)}dVedVudVs. 


We have similar results in the non-homogeneous case. Thus if 


(97) lim — 


(98) F(P;a) =A +f j(PP,) dp,? (8; a) 
+ff PP») dp,? (S; 2) dp,P (8; 2), 


| 

1 

| 
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we have almost everywhere 


(99) lim F(P +34) 


=4 9(0,Q)aVo+4 Q)aVo 
+ 


10. Chaos theory and spectra.’° The function 


(100) tien ay J, + = 


occupies a central position in the theory of harmonic analysis. If it exists 
and is continuous for every value of P, the function F(P) is said to have an 
n-dimensional spectrum. To define this spectrum, we put 


F(P) on Rk; 
(101) 0 elsewhere. 


It is then easy to show by an argument involving considerations like those of 
(49) that if 


(102) G(P) = + 
V(r) Joo 
the integral being taken over the whole of space, then we have 
(103) G(P) =limG,(P). 


Since, if O is the point with zero codrdinates, by the Schwarz inequality, 
(104) | G(P)| G(O), 


the limit in (103) is approached boundedly. 
If now we put 


(105) = V(r) AL. i. m. f F,(P) e-PdV p 
8 


where § is the interior of a sphere of radius s about the origin, the n-fold 


Parseval theorem will give us 


10 Cf, N. Wiener, “Generalized harmonic analysis,” Acta Mathematica, vol. 55 
(1930). 


= 
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(106) | |? = G,(P) p. 


If M(U) is a function with an absolutely integrable Fourier transform, we 
shall have 


co ye 

-CO co co 
and hence 

. 

(108) lim | |? M(U)dVu = (2m) f O(P)aVe , 

e 
which will always exist. Let us put 
(109) 94{M(U)}= f G(P)dVe 

co 


If § is any set of points of finite measure, and S(P) is its characteristic 
function, let us put 


(110) H(S) = lub. H(M(U)), 
M(U)=S(U) 
and 
(111) H(S) = g.l.b. M(M(U)). 
M(U)=S(U) 


If 9(S) and 9{(S) have the same value, we shall write it 9/(S), and shall 
call it the spectral mass of F on 8. It will be a non-negative additive set- 
function of S, and may be regarded as determining the spectrum of F. 

If f(Pi,: - +, Pn) satisfies (80) and F(P;«) is defined as in (88), we 
know that for any given P, 


for almost all values of « This alone is not enough to assure that F'(P, «) 
has a spectrum for almost all values of «, as the sum of a non-denumerable set 
of sets of zero measure is not necessarily of zero measure. On the other hand, 
except for a set of values of « of zero measure, G(P, «) exists for all points P 
with rational codrdinates. 

We may even extend this result, and assert that if 


(113) F,(P;2) - F(S;a)aVs 
of 
and 
J (1) 


(11 


i 
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then except for a set of values of a of zero measure, Gg(P; a) exists for all 
points P with rational codrdinates and all rational parameters 6, and it is 


easily proved that for almost all values of a, as 6 tends to 0 through rational 
values, 


(115) lim lim - | Fo(Q; %) —F(Q; a) |? dVo=0. 
V = r) 


Now, by the Schwarz inequality, 


(P, + Q32)F,(Q; 2)dVo | 


l 


= G,(0; PP, 


It thus follows that if (114) exists for a given @ and all P’s with rational 
coordinates, it exists for that @ and all real P’s whatever. We may readily 
show that 

(117) Gg(O;a%) = G(O;¢). 


By another use of the Schwarz inequality, 
F(P + Q;2)F(Q;a)dV¢ 
Se | Po(P + Q; 2) —F(P+Q;2)| | «)| 
| F(P+Q;)| | —F(Q;@)| dVo 
V (1) R 
< {G(0:2) aS, | + —F(P+ Q;«)|? dVo}! 
V(r) 
| — F(Q; 2) |? dVo}?. 
V(r) Jr 


Combining (115) and (118), we see that except for a set of values of « of zero 
measure, we have for all P, 


(119) G(P;«) =lim Gg(P; 2). 
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We thus have an adequate basis for spectrum theory. This will extend, not 
merely to functions F(P,«) defined as in (88), but to finite sums of such 
functions. It will even extend to the case of any differentiable chaos F(P;«), 
for which F(P + Q;«)F(Q;«) is an integrable function of a, and for which 
(115) holds. For a metrically transitive chaos, this latter will be true if 
(120) lim | Fy(P; 2) —P(P; 4) |? da—0. 

Under this assumption, we have proved that F(P) has a spectrum, and the 
same spectrum, for all values of a. 

This enables us to answer a question which has been put several times, 
as to whether there is any relation between the spectrum of a chaos and the 
distribution of its values. There is no unique relation of the sort. The 
function 


(121) f 


where g belongs to L*, may be so chosen as to represent any Fourier transform 
of a positive function of L, and if f(P,Q,M) is a bounded step-function, the 
right-hand side of (97) will clearly be the Fourier transform of a positive 
function of LZ. In particular, let f(P:, P2,P3) =f(P:)f(P2)f (Ps), 


(122) F,(P3a) f.(PP,)dp,® (832) 
and choose f,(@) in such a way that 


(123) f fi(P+ Q)f:(Q)dVo = right-hand side of (97). 
Then 


(124) (F,(P; | — 1) (2n — 8) ---1 
and if F(P,«) is defined as in (96), 

so that for all but at most one value of n, 

(126) (F,(P; a) (F(P; «))*da 


and we obtain in F' and F, two chaoses with identical spectra but different 
distribution functions. On the other hand, if 


( 
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the chaoses 


(128) f(PP.)de,? (8; 2) 


and 
(129) Jf fe( PP.) (85 
will have the same distribution functions, but may have very different spectra. 


11. The discrete chaos.1_ Let us now divide the whole of Euclidean 
n-space dichotomously into sets Sm,n, such that every two sets Sim,n, and Sms,nz 
have the same measure, and that each Sm,n is made up of exactly two non- 
overlapping sets Simi. Let us divide all these sets into two categories, 
“ occupied,” and “empty.” Let us require that the probability that a set be 
empty depend only on its measure, and that the probability that two non- 
overlapping sets be empty be the product of the probabilities that each be 
empty. Let us assume that both empty and occupied sets exist. Let every 
set contained in an empty set be empty, while if a set be occupied, let at least 
one-half always be occupied. We thus get an infinite class of schedules of 
emptiness and occupiedness, and methods analogous to those of paragraph 6 
may be used to map the class of these schedules in an almost everywhere one- 
one way on the line (0,1) of the variable «, in such a way that the set of 
schedules for which a given finite number of regions are empty or occupied 
will have a probability equal to the measure of the corresponding set of 
values of a. 

By the independence assumption, the probability that a given set Sm,n be 
empty must be of the form e-4™‘Sm)_ Tf Sinn is divided into the 2” intervals 
Sinavny’ °° > Smsv,ngv at the v-th stage of sub-division, the probability that just 
one is occupied and the rest are empty is 


(130) 2’(1—exp(— Am(Sm,n) /2”) exp (— Am(Sm,n 
This contingency at the vy + 1-st stage is a sub-case of this contingency at the 
v-th stage. If we interpret probability to mean the same thing as the measure 
of the corresponding set of as, then by monotone convergence, the probability 
that at every stage, all but one of the subdivisions of S»,n are empty, while the 
remaining one is occupied, will be the limit of (130), or 


(131) Am(Sm,n)exp(— Am(Smn)). 
11The ideas of this paragraph are related to discussions the author has had with 
Profes or von Neumann, and the main theorem is equivalent to one enunciated by the 


latter. 
10 
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Such a series of stages of subdivision will have as its occupied regions exactly 
those which contain a given point. 

The probability that the occupied regions are exactly those which contain 
two points is the probability that each half of Sn,» contain exactly one point, 
plus the probability that one-half is empty, and that in the occupied half, each 
quarter will contain exactly one point, plus and so on. This will be 


(132) ) Am(Smn) exp ( 


2 
Am(Smn)\ Am(Sm,n) Am(S8mn) \* 
+ 2 exp ( ) 5 exp 4 
+: +4+° °°) 


(Am (Sim,n) had 


9 exp (— Am(Sm,n) ). 


If the probability that the occupied regions are exactly those containing k — 1 
points is 
(Am 


(k—1)! exp (— Am(Smn,n) ) 


then a similar argument will show that the probability that the occupied 
regions are exactly those containing & points will be 


1 1 1 
(133) (Am(Sm,n) (1 + + +) 


— 2) (Am(Smn))* exp (— Am(Sm.n)) 


(A m (Sin,n) )' exp Am (Sin,n) ). 


Thus by mathematical induction, the probability that the occupied regions 


are exactly those containing / points will be 
k ax g 
(Am(Sm,n) )* exp (— Am(Sm,n) ) 
and the sum of this for all values of & will be 
1 
(134) >» (Am(Sm,n))* exp (— Am(Smn)) = 1. 
0 
In other words, except for a set of contingencies of probability zero, the 


occupied regions will be exactly those containing a given finite number of 


points. 
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We may proceed at once from the fact that the probability that a set S; 


contains exactly & points is 


(Am(S;) )* e-Am(S1) 


while the probability that the non-overlapping set S. contains exactly i 
points is 


a (Am (S2) 


to the fact that the probability that the set S,-+ S. contains exactly & 
points is 


k 1 


1 

kp-Am(S,+82) 

From this, by monotone convergence, it follows at once that the probability 

that any set S which is the sum of a denumerable set of our fundamental 

regions Sm,n should contain exactly k points is 


(Am(S) )e-Am(S), 


It is then easy to prove this for all measurable sets 9. 

We are now in a position to prove that the additive functional D(S;<), 
consisting in the number of points in the region S on the basis of the schedule 
corresponding to a, is a homogeneous metrically transitive chaos. The rdéle 
which continuity filled in paragraphs 6 and 7, of allowing us to show that 
was measurable in y;,° +, Yn and @, is now filled by the fact 
that the probability that any of the points in a region lie within la very small 
distance of the boundary, is for any Jordan region the probability that a small 
region be occupied, and is small. The metric transitivity of the chaos results 
as before from the independence of the distribution in non-overlapping regions. 

The discrete or Poisson chaos which we have thus defined is the chaos of 
an infinite random shot pattern, or the chaos of the gas molecules in a perfect 
gas in statistical equilibrium according to the old Maxwell statistical me- 
chanics. It also has important applications to the study of polycrystalline 
aggregates, and to similar physical problems. 

Two important formulae are 


1 fo k 
(136) D a) da = e-Am(S) (Am(S))* = Am(S), 
0 1 


1 
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and 
1 CO 7.2 

(137) f (D (S; a) == e-Am(S) (Am(S))* = (Am(S))? + Am(S). 
0 


Let it be noted that if we define 
(138) f(P)deD (83) 
for a measurable step-function f(P) as in (70), by 
(139) S fuD 
(72) is replaced by 
(140) F(Q)aVo |? 
fnfnD %)D (Sn; a) 


m=1 n=1 


“da fn D (Sn 3-2) f #(Q)dVo 
N ad 

— 2 | Im |? Am(Sm) 

| {(P) |? 


Thus the transformation from f(/’) as a function of P, to 


as a function of a, retains distance in Hilbert space, apart from a constant 
factor, and if f(P) belongs to LZ and ZL? simultaneously, and {fn(P)} is a 
sequence of step-functions converging in the mean both in the L sense and in 
the L? sense to f(P), we may define 


(142) f f(P)drD (85.2) #(Q)dVo 
+ Lim. fa(P)dpD —A fn(Q) 


As in the case of (73), this definition is substantially unique. We may prove 
the analogue of (93) in exactly the same way as (93) itself, and shall obtain 


(143) lim f, (f(P+ a)dyD (8; Af f(M)dV | 
79 
F(QM; «)dyD (8; a) 7(M)dVu ( 


— f (P+ 


a 

V 
d 
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As we may see by appealing to the theory of spectra, one interpretation of this 
in the one-dimensional case is the following: /f a linear resonator be set into 
motion by a haphazard series of impulses forming a Poisson chaos, the effect, 
apart from that of a constant uniform stream of impulses, will have the same 
power spectrum as the energy spectrum of the response of the resonator to a 
single impulse. 


12. The weak approximation theorem for the polynomial chaos. We 
wish to show that the chaoses of paragraph 9 are in some sense everywhere 
dense in the class of all metrically transitive homogeneous chaoses. We shall 
show that if %(S;) is any homogeneous chaos in n dimensions, there is a 
sequence %%,(S;«) of polynomial chaoses as defined in paragraph 9, such that 
if S,,° - +, Sy is any finite assemblage of bounded measurable sets in n-space 
selected from among a denumerable set, and 


“1 

(144) J 1 x (A= 
0 

is finite, then 


1 1 
e 0 0 


We first make use of the fact that if the probability that a quantity wu be 


greater in absolute value than A, be less than 


2 
(146) e7 2B) du, 
V A 
then if y(w) is any even measurable function bounded over (— 2%, 0), we 
may find a polynomial y(u), such that the mean value of 


(147) | — (u) |", 

which will be 

(148) — We(u) |" du 


is less than e. Since it is well known that if ¢(w) is a continuous function 


vanishing outside a finite interval, and 


0° 


(149) AnAn 


1 
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is the series for @(w) in Hermite functions, then we have uniformly 


(150) o(u) = lim 


t-1-0 1 
to establish the existence of y.(u), we need only prove it in the case in which 


(151) y(u) = uke Cw 


for an arbitrarily small value of C’: as for example for C = 1/4n,B. We shall 
then have 


| N(cu?)* = (on’}* 
so that by dominated convergence, and if we take N large enough, we may 
make 
| 
(153) —Xe(u) |\"e du < (n= n,). 
V -0O 


Now let 


(154) (u) = 


and let us put 
(155) §(P;«) =——~ (S = interior of |QP| Sr). 


V(r) 
The chaos 
(156) 


may then be approximated by polynomial chaoses in such a way as to approxi- 
mate simultaneously to all polynomials in @(P;a) by corresponding poly- 
nomials in the approximating chaoses. Since the distribution of the values 
of &(P;«) will be Gaussian, with a root mean square value proportional to a 
power of r, and §(P;«) will be independent in spheres of radius 7 about two 
points P, and P, more remote from each other than 2r + 2y, it follows that 
if we take K to be large enough, we may make the probability that #(P:; «) 
differs from 0 between two spheres of radii respectively r+ 7 and H about a 
given point where it differs from 0, as small as we wish. 
We now form the new chaos 
(157) —~ £(0;2)dVa, 


which we may also approximate, with all its’ polynomial functionals, by 4 
sequence of polynomial chaoses. The use of polynomial approximations 
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tending boundedly to a step function over a finite range will show us that 
this is also true of the chaos determined by 


(158) | = M(P;2). 


By a proper choice of the parameters, this can be made to have arbitrarily 
nearly all its mass uniformly distributed over regions arbitrarily near to 
arbitrarily small spheres, all arbitrarily remote from one another, except in 
an arbitrarily small fraction of the cases. We then form 


(159) (In (Q; a) +8) exp (— ave = 


where 6 is taken to be very small. This chaos again, as far as all its poly- 
nomial functionals are concerned, will be approximable by polynomial chaoses. 
Since it is bounded away from 0 and o, and since over such a range the 
function 1/z may be approximated uniformly by polynomials, it follows that 
in our sense, 


(160) 1/N (P; 


is approximable by polynomial chaoses. 
If w(P) is any measurable function for which arbitrarily high moments 
are always finite, it is easy to show that 


2 


) dVo = 2) 


is approximable by polynomial chaoses. Multiplying expressions (160) and 
(161), it follows that 
(162) = U(P; 2) 


is approximable by polynomial chaoses. 


If A is a large enough constant, depending on the choice of the constant e, 
we have 


1 | P 
(163) (2rk)"/? f, si ) J 


2 OO 
0 
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Thus by the proper choice of the parameters of 9n(P;«), if we take k small 
enough and then 6 small enough, the chaos (162) will consist as nearly as we 
wish, from the distribution standpoint, of an infinite assemblage of convex 
cells of great minimum dimension, in each of which the function w(P) is 
repeated, with the origin moved to some point remote from the boundary. 
Now let %(S;%) be a metrically transitive homogeneous chaos. Let 


us form 


V(r) Jr 


Clearly by the fundamental theorem of the calculus, over any finite region in 


Z1,° ° *,%n), we shall have for almost all points and almost all values of « 
, >In), 
(165) a) =lim 83a); 


and if (144) holds, it is easy to show that 
(166) 1 (738; a)!"da < const. 
From this it follows that 
1 
(167) lim | a) —B(S; a) |"de = 0 
r-0 0 


and by the ergodic theorem, except for a set of values of « of zero measure, 
as r tends to 0 through a denumerable set of values, 


(168) lim | Bar ...,an(73 a) 
roo V(r) Jr 
— Fa, %) - = 0. 


With this result as an aid, enabling us to show that the distribution of 3(S; «) 
is only slightly affected by averaging within a small sphere with a given radius, 
or even within any small region near enough to a small sphere with a given 
radius, we may proceed as in (161) and (162) and form the chaos 


1 
(169) Be(83 2) — 20958) 


(— 
(M(a,,° * + 8) exp dx, + + 


q ( 
( 
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For almost all 8, in each of the large cells of this chaos, (169) will have as 
nearly as we wish the same distribution as some %z,,...,2,(5;a), where 
(71,° * *,%n) lies in the interior of the cell, remote from the boundary. These 
cells may so be determined that except for those filling an arbitrarily small 
proportion of space, all are convex regions with a minimum dimension greater 
than some given quantity. 

To establish (143), it only remains to show that the average of a quantity 
depending on a chaos over a large cell tends to the same limit as its average 
over a large sphere. To show this, we only need to duplicate the argument 
of paragraph 4, where we prove the multidimensional ergodic theorem, for 
large pyramids with the origin as a corner, instead of for large spheres about 
the origin. We may take the shapes and orientations of these pyramids to 
form a denumerable assemblage, from which we may pick a finite assemblage 
which will allow us to approach as closely as we want to any cell for which 
the ratio of the maximum to the minimum distance from the origin within 
it does not exceed a given amount. It is possible to show that by discarding 
cells whose measure is an arbitrarily small fraction of the measure of all space, 


the remaining cells will have this property. 


13. The physical problem. The transformation of a chaos. The 
statistical theory of a homogeneous medium, such as a gas or liquid, or a field 
of turbulence, deals with the problem, given the statistical configuration and 
velocity distribution of the medium at a given initial time, and the dynamical 
laws to which it is subject, to determine the configuration at any future time, 
with respect to its statistical parameters. This of course is not a problem in 
the first instance of the history of the individual system, but of the entire 
ensemble, although in proper cases it is possible to show that almost all sys- 
tems of the ensemble do actually share the same history, as far as certain 
specified statistical parameters are concerned. 

The dynamical transformations of a homogeneous system have the very 
important properties, that they are independent of any choice of origin in 
time or in space. Leaving the time variable out of it, for the moment, the 
simplest space transformations of a homogeneous chaos %(S; a) which have 
this property are the polynomial transformations which turn it into 


Kot f K, (4, — 1, Lo — Yo, Tn — Yn) Bu, mn (S dyn 
(170) 


tn — Yn ) Buca), 
Burry, my a)dyi™ dyn. 
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These are a sub-class of the general class of polynomial transformations 


Ko+ K, (2, °° Yn) Bu, ..-,9n(93 a)dys dyn 


+ f 541 yn ™) 


If a transformation of type (171) is invariant with respect to position in space, 
it must belong to class (170). On the other hand, in space of a finite number 
of dimensions and in any of the ordinary spaces of an infinite number of 
dimensions, polynomials are a closed set of functions, and hence every trans- 
formation may be approximated by a transformation of type (171). 

A polynomial transformation such as (170) of a polynomial chaos yields 
a polynomial chaos. If then we can approximate to the state of a dynamical 
system at time 0 by a polynomial chaos, and approximate to the transformation 
which yields its status at time ¢ by a polynomial transformation, we shall 
obtain for its state at time ¢, the approximation of another polynomial chaos. 
The theory of approximation developed in the last section will enable us to 
show this. 

On the other hand, the transformation of a dynamical system induced 
by its own development is infinitely subdivisible in the time, and except in 
the case of linear transformations, this is not a property of polynomial trans- 
formations. Furthermore, when these transformations are non-linear, they 
are quite commonly not infinitely continuable in time. For example, let us 
consider the differential equation 


(172) +u 
This corresponds to the history of a space-distribution of velocity transferred 
by particles moving with that velocity. Its solutions are determined by the 
equation 
(173) u(z,t) t),0), 


or if y is the inverse function of u(z, 0), 
(174) z—tu(zr,t) =y(u(za,t)). 


Manifestly, if two particles with different velocities are allowed to move long 


| 

| 
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enough to allow their space-time paths to cross, u(x, ¢) will cease to exist as a 
single-valued function. This will always be the case for some value of ¢ if 
u(z,0) is not constant, and for almost all values of ¢ and «@ if it is a poly- 
nomial chaos. 

By Lagrange’s formula, (174) may be inverted into 


n! da" 
In a somewhat generalized sense, the partial sums of this formally represent 
polynomial transformations of the initial conditions. However, it is only for 
a very special. sort of bounded initial function, and for a finite value of the 
time, that they converge. It is only in this restricted sense that the poly- 
nomial transformation represents a true approximation to that given by the 
differential equation. 

It will be seen that the useful application of the theory of chaos to the 
study of particular dynamical chaoses involves a very careful study of the 
existence theories of the particular problems. In many cases, such as that of 
turbulence, the demands of chaos theory go considerably beyond the best 
knowledge of the present day. The difficulty is often both mathematical and 
physical. The mathematical theory may lead inevitably to a catastrophe 
beyond which there is no continuation, either because it is not the adequate 
presentation of the physical facts; or because after the catastrophe the physical 
system continues to develop in a manner not adequately provided for in a 
mathematical formulation which is adequate up to the occurrence of the 
catastrophe ; or lastly, because the catastrophe does really occur physically, and 
the system really has no subsequent history. The hydrodynamical investi- 
gations required in the case of turbulence are directly in the spirit of the work 
of Oseen and Leray, but must be carried much further. 

The study of the history of a mechanical chaos will then proceed as 
follows: we first determine the transformation of the initial conditions gen- 
erated by the dynamics of the ensemble. We then determine under what 
assumptions the initial conditions admit of this transformation for either a 
finite or an infinite interval of time. Then we approximate to the transforma- 
tion for a given range of values of the time by a polynomial transformation. 
Then, having regard to a definition of distance between two functions de- 
termined by the transformation, we approximate to the initial chaos by a 
polynomial chaos. Next we apply the polynomial transformation to the poly- 
nomial chaos, and obtain an approximating polynomial chaos at time tf. 


Finally, we apply our algorithm of the pure chaos to determine the averages 
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of the statistical parameters of this chaos, and express these as functions 
of the time. 

The results of such an investigation belong to a little-studied branch of 
statistical mechanics: the statistical mechanics of systems not in equilibrium. 
To study the classical, equilibrium theory of statistical mechanics by the 
methods of chaos theory is not easy. As yet we lack a method of representing 
all forms of homogeneous chaos, which will tell us by inspection when two 
differ merely by an equimeasure transformation of the parameter of distribu- 
tion. In certain cases, in which the equilibrium is stable, the study of the 
history of a system with an arbitrary initial chaos will yield us for large values 
of ¢ an approximation to equilibrium, but this will often fail to be so, particu- 
larly in the case of differentiable chaoses, or the only equilibrium may be that 
in which the chaos reduces to a constant. 
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ON HILL’S PERIODIC LUNAR ORBIT. 


By AUREL WINTNER. 


Hill’s desideratum. This paper deals with a question concerning the 
mathematical foundations of Hill’s lunar theory, a theory developed in the 
first volume of this Journal’? and considered by Poincaré as representing a 
turning point in the history of celestial mechanics. 

From the mathematical point of view, the most original, but also the 
boldest, step in Hill’s lunar theory consists of abandoning the classical method 
of variation of constants, i.e., Lagrange’s theory of perturbations, and replacing 
it by successive applications of the method of infinitely many variables. Among 
mathematicians, Hill’s name is remembered mainly for his theory of the lunar 
perigee, a theory depending on a linear Eigenwertproblem of specific type and 
leading to an infinite system of homogeneous linear equations. Hill’s formal 
treatment of this problem by means of transcendental entire functions which 
are defined by infinite determinants has already been justified by Poincaré and 
has lead, via the investigations of Fredholm, to Hilbert’s theory of linear trans- 
formations. Thus it is understandable that the introduction of the theory of 
infinite linear systems is generally considered as Hill’s main contribution to 
pure analysis. 

Actually, Hill’s fundamental paper (Joc. cit.) does by no means deal with 
the linear Higenwertproblem which leads to infinite determinants, but rather 
with a non-linear problem which leads to an infinite non-linear and non- 
recursive system of equations for infinitely many unknown Fourier constants ; 
a system which seems to involve steps essentially more delicate than those which 
are implied by the application of infinite determinants in the linear problem. 
Furthermore, the linear problem presupposes that the non-linear problem has 
already been solved. In fact, the linear problem merely concerns the in- 
finitesimal isoenergetic normal displacements of those periodic solutions of the 
non-linear equations of motion which lead to the infinite system of non-linear 


conditions. 


1G. W. Hill, “ Researches in the lunar theory,” American Journal of Mathematics, 
vol. 1 (1878), pp. 5-26, 129-147, 245-260. 

For a detailed presentation of Hill’s programme in lunar theory, cf. pp. 195-236 
(Chap. XI) of E. W. Brown’s Treatise on the Lunar Theory (Cambridge, 1896), where 
further references may be found. 
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Hill was well aware of the novelty of the mathematical problem involved 
in his infinite non-linear implicit system of equations: “I regret that, on 
account of the difficulty of the subject . . . it does not appear that anything 
in the writings of Cauchy will help us to the conditions of convergence ” (loc. 
cit.’, p. 8). On the other hand, not only were Hill’s numerical results in 
perfect agreement with the observed motion of the Moon, but also the terms 
of his expansions, obtained from the infinite non-linear system of equations in 
a formal and approximative manner, have shown, in contrast with the corre- 
sponding expansions of the other lunar theories, a surprisingly rapid decrease 
in absolute value; so that Hill (loc. cit.', p. 8) describes the probable situation 
as follows: “ There cannot be a reasonable doubt that, in all cases, where we 
are compelled to employ infinite series in the solution of a problem, analysis is 
capable of being perfected to the point of showing us within what limits our 
solution is legitimate, and also giving us a limit which its error cannot surpass.” 


The analytical problem. The problem implied by this desideratum of 
Hill was treated by the writer several years ago. The method consisted in 
proving (loc. cit.?, pp. 263-265) a general existence theorem on infinite non- 
linear implicit systems which turned out to be the appropriate tool for dealing 
with Hill’s problem and, at the same time, gave explicit estimates of the error 
terms which, in Hill’s procedure, result by stopping the process of the com- 
parison of coefficients at a certain stage. 

The object of the present paper is to simplify this direct existence and 
convergence proof in such a way as to make it adaptable to sharper estimates. 
For this end, it will be convenient to use an existence theorem on infinite non- 
linear implicit systems which is somewhat more general * than the one used 
loc. cit.*, and takes into account the peculiar situation in orders of magnitude 
which is an essential point in the problem. The proof of this appropriate 
existence theorem hardly differs from the proof of the more particular existence 
theorem which was used loc. cit.? and will, therefore, be omitted. Nevertheless, 
the paper is readable in itself. 

Due to this general existence theorem, the treatment of Hill’s problem 
becomes free of any ad hoc device, proceeds exactly along the lines which were 
followed by Hill in a formal way, and, in addition, takes into account the 
successive decrease of the orders of magnitude; thus proving, and also ex- 
plaining, the convergence of Hill’s process and supplying the realization of 


Hill’s desideratum. 


2A. Wintner, “ Zur Hill’schen Theorie der Variation des Mondes,” Mathematische 
Zeitschrift, vol. 24 (1925), pp. 259-265. 

8’ Cf. A. Wintner, “ Uber die Differentialgleichungen der Himmelsmechanik,” Mathe- 
matische Annalen, vol. 96 (1926), pp. 284-312. 
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It should be mentioned that the question as to the existence of Hill’s 
family of periodic lunar orbits is implied by, but is not identical with, the 
question as to the legitimacy of Hill’s bold method of infinitely many variables. 
In fact, the question as to the existence of these periodic orbits can be settled, 
according to Birkhoff, in a rather simple and intuitive manner.‘ 


The equations of condition. Let x,y be the geocentric codrdinates of the 
Moon in the rotating (i. e., synodic) Cartesian plane of Hill’s restricted problem 
of three bodies. Then, if ¢ denotes the time and m (2 0) an integration con- 
stant which varies in the neighborhood of m= + 0, Hill’s aim (loc. cit.’, 
pp. 129-131) is to find a continuous family of periodic solutions of the form 

+00 


(1) t= a; c0s(2-+ 1)t/m, y= 3 a, sin(2i + 1)t/m, 


where the Fourier coefficients a; are unknown and are to be determined as real 
analytic functions, 
(2) a, = a; (m) (t=0,+1,+ 2,: 


of the integration constant m, which.is the parameter of the family (1). 


One can replace m by the varying period 
(3) T = 2xm (m 2 0) 


of (1) and also by Jacobi’s constant C = C(m), i. e., by the (relative) energy, 
where, however, C(m) > -+ o whether m—-+ 0 or m—>—(0, i.e., whether 
the solution belonging to a small m is direct or retrograde. The numerical 
value, m,, of m in the case of the Moon is, of course, positive and somewhat 
less than Yo: 

(4) My = 0.08084 - 

(cf. loc. cit.?, p. 146). 

Now, Hill’s method of calculating the unknown family (1) of periodic 
solutions, i. e., the infinitely many unknown functions (2), consists simply of 
a direct substitution of (1) into the differential equations of motion and their 
Jacobi integral. This gives, by comparison of the coefficients in the resulting 
Fourier series, infinitely many implicit conditions for the infinitely many 
functions (2). Since the equations of motion are non-linear, so are these 
implicit conditions. In fact, Hill finds after several ingenious transformations 


*G. D. Birkhoff, “The restricted problem of three hodies,” Rendiconti del Circolo 
Matematico di Palermo, vol. 39 (1915), Part III, §12. Cf. also A. Wintner, Sorten- 
genealogie, Hekubakomplex und Gruppenfortsetzung,’ Mathematische Zeitschrift, vol. 
34 (1931), §5 (Fig. 1) and §7. A Russian paper by A. Liapounoff is reviewed in the 
Fortschritte der Mathematik, vol. 26 (1895), p. 1103. 
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that the infinite system of conditions which are to be satisfied by the functions 
(2) consists, on the one hand, of the equation ® 


(5) { {47 + 46+ 1+ 4¢+ 3m?}a; = m? 


and, on the other hand, of the simultaneous infinite system ° 


(6) {[j, + m?[ + =0; 


+ 1, 2,°- -), 


where [j,7], [j], (j) are rational functions of the independent parameter m, 
namely 
& 47° + 49 2 — 4(t—j +1) m + 


(7) [7,1] = j 2(4j7? — 1) —4m m? 
3 47?—8j)—2—4(j + 2)m— 9m? 
(8) [7]= 167? 2(47? — 1) —4m + m? 
(9) (j) —— 16) + 2—4(5j —2)m + 9m? 
16? 2(4j7? —1) —4m + m? 


with the understanding that 7 = 

Accordingly, the problem requires a proof to the effect that the infinite 
system (6), (5) of non-linear and non-recursive equations defines the infinitely 
many unknown functions (2), and that, for a fixed value (3) under considera- 
tion (and, in particular, for the fixed value (4) of m) the series (1) converge 
to functions z= 2(t), y=y(t) which have second derivatives. For then 
(1) is a solution of the equations of motion, since (6), (5) represent all of the 
conditions obtained by the method of undetermined Fourier coefficients. Need- 
less to say, v= z(t), y=y(t) must turn out to be analytic functions of /, 
the equations of motion being analytic. Incidentally, the functions (2) of m 
also prove to be analytic. 


5G. W. Hill, loc. cit.1, p. 145. This condition of Hill, when written in the above 
form (5), has as its right-hand member the number x, instead of m*. Actually, « = m’, 
if one chooses units in the usual way. In fact, Hill’s n’ denotes the constant angular 
velocity of the synodic coérdinate system, while his u is the total mass (loc. cit.’, p. 9). 
Furthermore, m = n’/(n—n’) and k= y4/(n—n’)? is his notation (loc. cit.1, p. 9). 
Hence x = m?, if one chooses, as usual, n’ = 1 and 1. 

* Ibid., p. 135. For reasons which will become apparent in (16) below, Hill’s 
[j]/m* and (j)/m? have been denoted by [j] and (j) respectively, while Hill’s [j, 4] 
is the same as the above [j,i]. 

7 Ibid., pp. 134-135. The factor m* of Hill’s [j] and (j) has been dropped in 
accordance with the preceding footnote. 
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Since the given rational functions (7), (8), (9) of the independent 
parameter m do not contain the unknown functions (2), the portion (6) of 
the complete system of conditions (5), (6) is homogeneous in the a; = a;(m) 
and can, therefore, only determine the ratios 


(10) = a4 : ==ai(m) : ao(m) = bi(m) (t= +1,+2,:-:) 


of the unknown functions (2). If the functions (10) are known, the func- 
tions (2) follow from the remaining condition (5). In fact, the expression 
on the left of (5) is an infinite trilinear form in the a; = a;(m) ; so that, 
if the functions (10) of m are known, a, follows as a function of m by the 
extraction of a cubic root. (Actually, the exponent of the radicand in m turns 
out to be %, instead of being merely 4; and this % expresses the infinitesimal 
validity of Kepler’s third law). 

Accordingly, the infinite system (5), (6) for the unknown functions (2) 
can be replaced by the infinite system (6) for the unknown functions (10). 


A prepared form of the equations of condition. It is seen from (7) 


that 
(11) [j,j] =—1 and [j,0] —0 


for every m. Hence, (6) can be written as 
+00 +00 
i=-OC 4=-00 


+m*(j) + 2m? [7 ]aoaj-1 + 2m? (7) 
i=-00 
(ge 1,2 +); 


where, corresponding to (11), the prime of the first sum means that 


(13,) iAjandtAOVin , (j 1, +); 
while two primes denote that 7 

(132) ix j7—1 andi¥0 in 1,4 3,°- +), 
finally three primes that s 

(13,) iA —j—1 andi+¥0 in (j= 1, 3, - +). 
Since neither of the indices k, 1 of an arbitrary term - - -a,a1 of the three 


sums (13,)-(13,) occurring in (12) vanishes, the homogeneous equations (12) 
for the unknown ratios (10) appear in the form 


11 


+OC 
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4=-00 i=-00 


(7) + 2m*[ + 2m? (7) 
if one divides (12) by a)’. 
Now consider, instead of the ratios (10), the unknowns c; which are 
defined by 
(15) b;(m) =b; miilc;, so that cj; = cj(m) (j= 
Then the j-th of the equations (14) appears, upon division by ml/l*1, in the 
form 


+00 
i=-00 i=-00 
+00 
=-00 
(jm +1,+2,-- 


where the exponents are integers; in fact, corresponding to (13,)-(133), 


(11) 
+00 
while obviously 
(18,:) +1; (182) 
Since 7 == + 1, + 2,- - -, comparison of (17%) with (13.), where v = 1, 2, 3, 
shows that 
(19) the integers Bij, yij, 8j;, are non-negative, 


the inequalities 6; = 0 and ej = 0 being obvious from (18,)-(18.). 


Estimates for the coefficient functions. In what follows, use will be 
made of the notation 


(20) == f*(m) | Cas | m*, if f=f(m) = Cnm” 


is a regular analytic function of the independent variable m in the vicinity 
of the origin m = 0. 

Since 7 0, any of the infinitely many rational functions (7), (8), (9) 
is regular at m0; so that (20) defines [7,7]*, [j7]*, (j)* as power series 
in m with real non-negative coefficients. Now, all these power series are con- 


+00 +00 
1 
é 
{ 
| 
+00 
‘ 
3 
} 
{ 
¥ 
n=0 
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vergent in the circle | m|=1 of the complex m-plane. Furthermore, there 
exists a bound C which is independent of 7 and 7 and is such that 


(21) | < | for | m| 
and 
(22) I[7]* | < C/j#? and |(j)* | < C/j? for | m| 


In fact, if f(m) is regular analytic ina circle | m| < R, and if | f(m)| <M 
in this circle, then, on using the notations (20), one has | ¢n | << M/R" (Cauchy), 
and so, if R > 1, 

| f*(m)| < MR/(R—1) for | m|S1. 


Consequently, (21) is true if there exists two constants M and RF such that 
R> 1 and all of the functions (7) are regular and less than M in absolute 
value in the circle | m| < R. Now, on dividing the numerator and denominator 
of (7) by 2(4j?—1), one sees that M and Rk >1 exist if the rational 
functions 

4m —m? 


where jw +1, +32,:--, 


are regular and uniformly bounded in a circle | m|< &, where R> 1. Finally, 
this condition is satisfied by R = %, since if |m|<1+ ¥, then 


— m?| (1+ 4%) (441+ %) <652(47—1) for +1, + 
This proves (21) ; while (22) follows in the same way from (8), (9) as (21) 


did from (7). 
A consequence of (21) and (22) is that if the variables m ; ¢,, ¢_1, C2, C-2,° °° 


“9 


are restricted by the inequalities 


but are otherwise arbitrary, then the estimates 


(25) ¥ * | << (j=+1,+2, ) 
and 
+0C 
| 3” | | < 
(26) 


|(7)* | x” | | << B/]?, 


hold for a sufficiently large constant B which, in addition, can be chosen such 

that 

(27) 
2 (7)* | < 


e 
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In fact, it is clear from (19) and (21) that, in the domain (24), the 
expression on the left of (25) is less than the product of the constant C and of 
7-7-1 jj) | | -1- — 7). 
i=-0O 4=-00 


Since the first of these sums can be written as 


+00 
(t— j)-? = - const., 
4=-00 


and the second as 


+00 +0 +00 
= | |-1- const. < | j - Const. | 7 Const.//?, 

the proof of (25) is complete. 

Furthermore, (19) and (22) show that, in the domain (24), the ex- 
pression on the left of the first of the inequalities (26) is less than 

(— 1 + —1)? < = const./7?. 


This proves the first of the inequalities (26), while the second follows in the 
same way. 

Finally, (27) is clear from (19), (22) and (24). 

The existence theorem. Without regard to questions of convergence, let 

dn = gn (20, 21° in igs 
where the n non-negative integers 7,,: - -,% need not be distinct, denote a 
form of degree n in the infinitely many variables 2», z,,- - -, and suppose that 
the terms of this n-fold sum have been contracted completely, i.e. that the n 
equalities 1; = ji,° * *,% = jn always imply the vanishing of at least one of 
the two constant coefficients 4;,...i,, 4j,...j,. On this assumption, associate 
with ¢, the unique form 


(so that ¢n=¢*, if and only if alla=0). And let f* denote the unique 


power series 


f* = f* (2, —3 $*,, if f =f (2, 


n=0 


is a power series in the infinitely many variables Zo, 2:,- - - ; so that, corre- 


sponding to (20), 


7 
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n=0 n=0 
Let there be given an infinite sequence 
(28) Yrs Yo, fi 5 Yr, Yo, 
of such power series f (2%, 21, 22,° Where 2, = Y1, 2 


Suppose that there exist two positive constants and two sequences of positive 
constants, say 


(29) asp and My: *, 

such that, on the one hand, 

and, on the other hand, the sequence of power series (28) satisfies the conditions 
(31) Bi, S Mi, (& = 


(which, in particular, imply that each of the power series (28) is absolutely 
convergent in the domain 
of infinitely many dimensions). 

Now (cf. loc. cit.? and loc. cit.*), if the power series (28) in infinitely 
many variables are such that the conditions (30), (31) are satisfied by positive 
constants (29), then the infinite implicit system of equations 


(33) Yc = Yr, Y2,° * (4 =1,2,-- -) 


has in a sufficiently small fixed circle about the origin of the complex z-plane, 


namely at least in the circle 


(34) | «| < min (4, 

which is independent of &, exactly one regular analytic solution yx = yx(@). 
Furthermore, 

(35) | < By, for |x| < min (4, »), =1,2,°°-). 


Finally, every y,(x) is real for real « if the coefficients of the power series (28) 
in infinitely many variables are real. 

The proof of this theorem (loc. cit.? and loc. cit.*) is such as to supply 
not only the existence of the regular functions y,(2) and their estimates (35) 
but also convenient appraisals of the errors which are committed if one only 
carries out the process of the method of undetermined coefficients to a given 


stage. 
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Application of the existence theorem. Now, this existence theorem is 
applicable to the problem at hand, if one identifies 


M5 Cry Co, C-2,° With 91, Ye, Ys, 


respectively. In fact, the rational functions (7), (8), (9) of m are regular 
at m = 0 and can, therefore, be developed according to non-negative powers 
of m=z. Hence it is clear from (19) that the system (16) for the unknown 
functions cj = c;(m) can be written in the form 


(36) = mF ;(m;3 C-1, C2, C_2,° * *) 


where the Fj are given power series in the infinitely many variables m; c; and 
have real coefficients. Furthermore, there exists a constant A > 0 such that 


(37) 1%, 1+. 2-2, 2-2, -) G=t 1, + 


In fact, since (25), (26), (27) hold on the assumption (24), one can choose 
A = 5B, the function F; in (36) being, by its definition, the function { } 
in (16). 

On identifying (36) with (33), one sees from (37) that the conditions 
of the existence theorem, namely (31) and (30), are satisfied by 


a= 1; By —1/j?, My = A/j? (j= +1,+2,---) 
since (30) is then satisfied by »1/A. Thus, the radius of the circle (34), 
where z= m, becomes min (1,1/A) ; while (35) states that | cj(m)| < 1/7’, 
hence | cj(m)| < 1, in this circle. Furthermore, c;(0) = 0, as seen by placing 
m = 0 in (36). 
Since (36) is identical with (16), while (16) is identical with (14) in 
virtue of (15), it follows that the system (14) has in the domain 


(38) |m| <A, where min (1,1/A), (B <1), 
exactly one regular analytic solution (10), and that the Taylor series of the 


infinitely many functions b;(m) are of the form 


(39) b;(m) = mlil*? Binm" G=t 


n=0 


where the constants B;n are real, finally that the power series (29) are in their 
common domain (38) of convergence such that 


(40) | bj (m)| < | m (jue t1,+2,---;|m|<A<1). 


Now, (14) is, in virtue of (10), equivalent to (6), while (5) becomes a 


H 
§ 
f 
ox 
» 
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cubic condition for the single function a) = a)(m), since the ratios (10) are 
the known functions (39). Finally,-direct substitutions readily show that the 
unique real analytic function a)(m) which is thus defined by (5) does not 
vanish for small m 2 0. 

This completes the treatment of Hill’s desideratum concerning (1). In 
fact, do = a)(m) being a known function of m, the coefficients (2) of (1) 
follow from (10) and (39) uniquely, while (40) assures that the series (2) 
are Fourier series of (analytic) periodic functions of t. 


The numerical situation. It should be mentioned that, on assuming in 
a formal way that b;(m) is a power series, it easily follows * from (6) or (14) 
without any reference to an existence theorem regarding (33), that 6;(m) 
must be divisible by mlJ!. Needless to say, this does not dispose of the neces- 
sity of an existence and convergence proof, although |m|< 1. In fact, the 
usual formal treatment ® of (6) does not take into account the possibility that 
not only (40) might not hold on a circle (38) which is independent of 7 but 
also that some or all of the power series (39) might be divergent for every 
m = 0, although b;(m) is, in a formal way, divisible by ml/!. Actually, it is 
clear a priori that if the periodic solution (1) exists at all for a fixed m which 
does not correspond to a collision path, then | a; | < const. J!/! for a suitable 
positive # < 1, since the periodic functions (1) must be analytic in the real 
variable ¢; so that (40) is by no means surprising, if the existence of (1) or 
(2) is assumed. 

Loc. cit.?, an existence theorem concerning (33) was applied to (14) 
instead of to (16), so that the numerical advantages of Hill’s equations of 
condition were not completely utilized. Furthermore, the existence theorem 
applied Joc. cit.2 was a particular case of the one used above, since, instead of 
the general condition (30) used above, it was assumed that all By and all M; 
are equal. This necessitated (loc. cit.2) the use of an absolute constant con- 
nected with a fact which is fundamental in Riemann’s theory of trigono- 
metrical series, namely with the fact that 

+00 

= 
All of this has made the numerical situation quite unfavorable, although it 
has allowed the proof of the fact ?° that the process is valid in a circle (38) 
whose radius is not less than 4». While this circle contains the value (4) 


8 Cf. H. Poincaré, Legons de mécanique céleste, vol. 2, part 2 (1909), pp. 35-36. 

® Cf. H. Poincaré, loc. cit.®, pp. 34-39. 

10 A, Wintner, “ Uber die Konvergenzfragen der Mondtheorie,” Mathematische Zeit- 
schrift, vol. 30 (1929), pp. 211-227. 
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belonging to the case of the Moon, Hill’s numerical evidence indicates that the 
convergence must be much stronger than what one would expect of the close- 
ness of the value (4) to the bound 4». = 0.08333: --, if this bound were 
sharp. Now, for reasons explained before, the method of the present paper 
leads to bounds much better than 4.5. The detailed numerical work to this 
effect is in progress. 

Of course, a bound of this type, no matter how sharp, is necessarily unable 
to answer the important question as to the “ natural termination ” (Strémgren) 
of Hill’s family of periodic orbits.** All that follows from the general theory 
is that such a “ dynamical termination ” exists.12 As far as the problem of 
convergence in the vicinity of m = + 0 is concerned, it is clear that complex 
singularities in m, which have no significance for the problem of termination, 
can act unfavorably. 

Correspondingly, it is, according to Hill (loc. cit.), convenient to carry 
out the expansions, not according to the powers of m but rather according to 
the powers of a linear function m= m(m) of m which vanishes with m and 
is chosen so as to transform those complex singularities in m which are 
formally indicated by (23) into positions which are not near the real axis. 
Clearly, this amounts to what is called, in the theory of divergent series, an 
Euler transformation. For instance, if m is positive and small, one improves 
on the closeness of the complex singularities of the rational functions (23) for 
j=2+1,+2,---, if one replaces m by the expansion parameter 


(41) m’ = m/(1+ m) 


of Delaunay’s lunar theory. It is easy to rewrite the existence proof of the 
present paper from m to this m’ or to Hill’s 7 = m(m), and then to calculate 
the corresponding bounds, thus improved. This numerical work also is in 


progress. 
Needless to say, a transformation of the type (41) is well adapted only 
to the case of direct paths, i.e., to the region (m > 0) of the Moon. In view 
of a result of Birkhoff,1* the initial behavior of the retrograde paths (m < 0), 
which are not excluded in (38), is a much easier problem than the correspond- 


ing behavior of the direct paths. 


THE JOHNS HOPKINS UNIVERSITY. 


11 Cf, A. Wintner, “ Almost periodic functions and Hill’s theory of lunar perigee,” 
American Journal of Mathematics, vol. 59 (1937), pp. 795-802, footnote 12 on p. 798. 

12,4. Wintner, “Beweis des E. Strémgrenschen dynamischen Abschlussprincips,” 
Mathematische Zeitschrift, vol. 34 (1931), pp. 321-349. Cf. G. D. Birkhoff, Sur le 
probleme restreint des trois corps, Second mémoire, chap. III, §5, Annali della R. 
Scuola Normale Superiore di Pisa, serie 2, vol. 5 (1936). 

18G. D. Birkhoff, loc. cit.*, § 18. Cf. also A. Wintner, loc. cit.*, § 8-§ 9. 
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